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Preface 

Read This First 



About This Manual 

TVP401 is a high performance graphics processor that balances high quality 
3D texturing and graphics performance with leading edge Windows, Video and 
SVGA acceleration. Based on a proven low-cost and scaleable architecture, 
TVP4010 accelerates a braod range of applications including games, 
animation, authoring, web browsers, design visualization, publishing and 
general multimedia applications. 

TVP401 sets the standard for 3D and multimedia acceleration, making it the 
ideal solution to meet the increasingly pervasive need for balanced 3D and 
multimedia acceleration-and all in a single, low cost PCI device. 

This document has been written as the primary reference for programmers 
and system designers who wish to develop software to drive the TVP4010. 
Information on programming the I/O registers can be found in the TVP4010 
Data Manual. 

An understanding of the principles of 2D and 3D graphics programming will be 
useful in reading this document. 

How to Use This Manual 

This document contains the following chapters: 

Chapter 1 gives an overview of TVP4010. 

Chapter 2 details the programming model for the chip. 

Chapter 3 describes the memory I/O and organization of TVP4010 supports 
in the framebuffer, localbuffer and texture buffer. 

Chapter 4 describes how to use TVP4010 for graphics rendering. 

Chapter 5 describes the initialization of TVP401 0. 

Chapter 6 provides tips for programming TVP4010. 
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Chapter 7 tabulates the TVP4010 registers. 

Chapter 8 gives lists of registers and their addresses. 

Appendix A gives the format used in the pseudocode examples throughout the 
document. 

Appendix B gives a table used to set up common screen widths. 
A glossary of technical terms follows the appendices. 
An extensive index is included. 



Notational Conventions 

This document uses the following conventions. 

□ Program listings, program examples, and interactive displays are shown 
in a special typeface similar to a typewriter's. Examples use a bold 
version of the special typeface for emphasis; interactive displays use a 
bold version of the special typeface to distinguish commands that you 
enter from items that the system displays (such as prompts, command 
output, error messages, etc.). 

Here is a sample program listing: 

0011 0005 0001 .field 1, 2 

0012 0005 0003 .field 3, 4 

0013 0005 0006 .field 6, 3 

0014 0006 .even 

Here is an example of a system prompt and a command that you might 
enter: 

C: csr -a /user/ti/simuboard/utilities 

□ In syntax descriptions, the instruction, command, or directive is in a bold 
typeface font and parameters are in an italic typeface. Portions of a syntax 
that are in bold should be entered as shown; portions of a syntax that are 
in italics describe the type of information that should be entered. Here is 
an example of a directive syntax: 

.asect "section name", address 

.asect is the directive. This directive has two parameters, indicated by 
section name and address. When you use .asect, the first parameter must 
be an actual section name, enclosed in double quotes; the second 
parameter must be an address. 
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□ Square brackets ( [ and ] ) identify an optional parameter. If you use an 
optional parameter, you specify the information within the brackets; you 
don't enter the brackets themselves. Here's an example of an instruction 
that has an optional parameter: 

LALK 1 6-bit constant [, shift] 

The LALK instruction has two parameters. The first parameter, 16-bit 
constant, is required. The second parameter, shift, is optional. As this 
syntax shows, if you use the optional second parameter, you must 
precede it with a comma. 

Square brackets are also used as part of the pathname specification for 
VMS pathnames; in this case, the brackets are actually part of the 
pathname (they are not optional). 

□ Braces ( {and} ) indicate a list. The symbol | (read as or) separates items 
within the list. Here's an example of a list: 

{ * I *+ I *- } 

This provides three choices: *, *+, or *-. 

Unless the list is enclosed in square brackets, you must choose one item 
from the list. 

□ Some directives can have a varying number of parameters. For example, 
the .byte directive can have up to 100 parameters. The syntax for this 
directive is: 

.byte value-i [,..., value n ] 

This syntax shows that .byte must have at least one value parameter, but 
you have the option of supplying additional value parameters, separated 
by commas. 

Information About Cautions and Warnings 

This book may contain cautions and warnings. 



This is an example of a caution statement. 

A caution statement describes a situation that could potentially 
damage your software or equipment. 
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This is an example of a warning statement. 

A warning statement describes a situation that could potentially 
cause harm to you . 



The information in a caution or a warning is provided for your protection. 
Please read each caution and warning carefully. 

Related Documentation From Texas Instruments 

The following books describe the TV401 and related support tools. To obtain 
a copy of any of these Tl documents, call the Texas Instruments Literature 
Response Center at (800) 477-8924. When ordering, please identify the book 
by its title and literature number. 

TV4010 Architecture Overview, Literature No. SLAU009 

TV4010 Installation Manual, Literature No. SLAU008 
TV4010 Data Manual, Literature No. SLAS155 
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If You Need Assistance . . . 



If you want to . . . 


Contact Texas Instruments at . 
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0120-81-0026 




International: 


81-3-3457-0972 or 






81-3-3457-0976 




Korea (Korean or English): 


82-2-551-2804 




Taiwan (Chinese or English): 


886-2-3771450 


Ask questions about Mixed 




(713) 274-2320 


Signal Processor (MSP) 


Fax: 


/7iq\ 07yi oqo/i 
(iio) iLI <\—tLoiL'\ 


product operation or report 


Fax Europe: 


+33-1-3070-1032 


suspected problems 


Email: 


4389750@mcimail.com 




World Wide Web: 


http://www.ti.com/dsps 




BBS North America: 


(713) 274-2323 8-N-1 




BBS Europe: 


+44-2-3422-3248 




320 BBS Online: 


ftp.ti.com:/mirrors/tms320bbs 






(192.94.94.53) 


Ask questions about micro- 




(713) 274-2370 


controller product operation 


Fax: 


(713) 274-4203 


or report suspected prob- 


Email: 


*H370@msg.ti.com 


lems 


World Wide Web: 


http://www.ti.com/sc/micro 




BBS: 


(713) 274-3700 8-N-1 


Request tool updates 


Software: 


(214) 638-0333 




Software fax: 


(214) 638-7742 




Hardware: 


(713) 274-2285 


Order Texas Instruments 


Literature Response Center: 


(800) 477-8924 


documentation (see Note 1) 







Make suggestions about or Email: comments@books.sc.ti.com 

report errors in document* Maj| . Texas , nstruments incorporated 

tion (see Note 2) Technical Publications Manager, MS 702 

P.O. Box 1443 

Houston, Texas 77251-1443 



Notes: 1 ) The literature number for the book is required; see the lower-right corner on the back cover. 

2) Please mention the full title of the book, the literature number from the lower-right corner of the back cover, and the 
publication date from the spine or front cover. 
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Trademarks 

3Dlabs, GLINT and Permedia are registered trademarks of 3Dlabs Inc. Ltd. 
OpenGL is a trademark of Silicon Graphics, Inc. 

Windows, Win32, Windows 95 and Windows NT are trademarks of Microsoft 
Corp. 

Macintosh, QuickDraw, and QuickDraw3D are trademarks of Apple Computer, 
Inc. 

RAMDAC is a trademark of Brooktree Corp. 
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Chapter 1 



Overview 

Chapter 1 presents a functional overview of the TVP401 operation. Subjects 
covered include external intefaces, memory subsystem, host interface, and 
task switching. 

Topic Page 
1.1 Functional Overview 1-2 
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1.1 Functional Overview 

TVP4010 is a single chip 3D graphics processor providing: 

□ 42M pixels/sec - textured, bilinear filtered with true per pixel perspective 

□ 800K polygons/sec -textured, bilinear filtered with true per pixel perspec- 
tive 

□ Balanced 3D feature set 

■ Ideal for games and pervasive 3D 

■ High quality texture mapping 

■ Smooth shading and blending 

■ Optional Z buffer 

■ Fog and depth cueing 

■ Polygon based with advanced 3D sprite handling 

□ Leading Windows acceleration 

□ Accelerated video playback 

□ Fast on-chip SVGA 

□ Optimized software drivers 

□ Register level interface 

□ Low-cost PCI design 

TVP401 supports 8 and 1 6 bit RGB 3D rendering and 8, 1 6 and 24 bit RGB 
and 8 bit color index 2D rendering. 

The TVP401 memory subsystem can hold up to 8 Mbytes of SGRAM. For vid- 
eo applications using current SGRAM technology the useful range of screen 
resolutions is up to 1600 x1200 pixels. 
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1.1.1 Memory Subsystem 

The TVP4010 provides flexible support for the memory subsystem (see 
Figure 1-1). This allows the system designer a wide choice of price/perfor- 
mance tradeoffs. 

The same physical memory holds all data used by the TVP401 0. Internally the 
data types are divided into texture, localbuffer, and framebuffer. The localbuf- 
fer holds depth and stencil data; the framebuffer holds color data for display. 

Figure 1-1. External Interfaces 



VGA 
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Bus 
Interface 



Graphics Hyperpipeline 
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Interface 



SGRAM 



Bypass 



1.1.2 Host Interface 

Conceptually, the TVP401 can be viewed as a register file. Control registers 
are primed with the information required for a primitive, and then to start the 
chip drawing, a write is made to a command register 

The TVP4010 registers can be accessed directly through the memory map. 
Registers can be accessed either individually or in groups. 
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The chip also supports a bypass route to the memory to allow direct read/write 
of pixels, and implementation of algorithms not directly supported by the 
TVP4010. 

1.1.3 Task Switching 

Where multiple applications wish to make simultaneous access to the 
TVP401 0, it is the responsibility of the software driving the chip to handle the 
loading of correct state. The TVP401 has been designed to support a number 
of different software architectures. 

□ Synchronous operation means that a new task can load its context without 
waiting for current rendering to complete. 

□ All loadable states can be read back. 

□ A Sync command is provided to flush all rendering. This can be polled or 
it can return an interrupt 

1.1.4 SVGA 

The TVP401 contains a fast VGA core. The TVP401 SVGA is used for DOS 
VGA applications and during boot time before switching to use the Graphics 
Hyperpipeline. This document does not cover VGA programming. Specific in- 
formation on the TVP401 VGA can be found in the TVP4010 Data Manual. 
VGA information, such as standard registers, is described in the Program- 
mer's Guide to the EGA, VGA and Super VGA Cards by Richards F. Ferraro. 
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Programming Model 



This chapter describes the programming model for the TVP401 0. It describes 
the interface conceptually rather than detailing specific registers and their 
exact usage. In-depth descriptions of how to program the TVP401 for specific 
drawing operations can be found in later chapters. 
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2.1 Memory Regions 

The TVP401 is divided into memory regions as shown in Table 2-1 . The re- 
gion address map is shown in Table 2-2. 



Table 2-1. Memory Regions 



Region 


Address Space 


Bytes 


Description 


Comments 


Config 


Configuration 


256 


PCI configuration 


PCI special 


Zero 


Memory 


128K 


Control registers 


relocatable 


One 


Memory 


8M 


Memory region one 


relocatable 


Two 


Memory 


8M 


Memory region two 


relocatable 


Three 


I/O 


16 


Auxiliary l/F registers 


Reserved 


Four 


Memory 


256K 


Reserved 


Reserved 


ROM 


Memory 


64K 


Expansion ROM 


relocatable 


SVGA 


Memory and I/O 




SVGA addresses 


optional and fixed 


Table 2-2. Region Address Map 


Address Range (hex) 






Description 


Byte Swap 


0000.0000 - 0000.0FFF 






Control & Status 


No 


0000.1 000- 0000.1 FFF 






Memory control 


No 


0000.2000 - 0000.2FFF 






GP FIFO access 


No 


0000.3000 - 0000.3FFF 






Video control 


No 


0000.4000 - 0000.4FFF 






RAM D AC 


No 


0000.5000 - 0000.5FFF 






Auxiliary l/F 


No 


0000.6000 - 0000.6FFF 






SVGA control 


No 


0000.7000 - 0000.7FFF 






Reserved 


No 


0000.8000 - 0000.FFFF 






GP registers 


No 


0001.0000- 0001. 0FFF 






Control & Status 


Yes 


0001.1000 - 0001.1 FFF 






Memory control 


Yes 


0001.2000- 0001. 2FFF 






GP FIFO access 


Yes 


0001.3000 - 0001. 3FFF 






Video control 


Yes 


0001.4000- 0001. 4FFF 






RAM D AC 


Yes 


0001.5000- 0001. 5FFF 






Auxiliary l/F 


Yes 


0001.6000- 0001. 6FFF 






SVGA control 


Yes 


0001.7000- 0001. 7FFF 






Reserved 


Yes 


0001.8000 - 0001. FFFF 






GP registers 


Yes 
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2.2 TVP4010 as a Register File 

The simplest way to view the interface to the TVP4010 Graphic Processor is 
as a flat block of memory-mapped registers (i.e. a register file). This register 
file appears as part of the address map for TVP401 0. 

When a TVP401 host software driver is initialized, it can map the register file 
into its address space. Each register has an associated address tag, giving its 
offset from the base of the register file (since all registers reside on a 64-bit 
boundary, the tag offset is measured in multiples of 8 bytes). The most straight- 
forward way to load a value into a register is to write the data to its mapped 
address. In reality the chip interface comprises a 32 entry deep FIFO, and each 
write to a register causes the written value and the registers address tag to be 
written as a new entry in the FIFO. 

Programming the TVP401 to draw a primitive consists of writing values to the 
appropriate registers followed by a write to a command register. This last write 
triggers the start of drawing. 

TVP401 has approximately 200 registers. All registers are 32 bits wide and 
should be 32-bit addressed. Many registers are split into bit fields, and it should 
be noted that bit is the least significant bit. 

In future chip revisions, the register file may be extended, and currently 
unused bits in certain registers may be assigned new meanings. Software 
developers should ensure that only defined registers are written to and that 
undefined bits in registers are always written as zeros. The only exception to 
this rule is that in certain registers it is convenient to allow unmasked values 
to be written to registers which hold numeric data. These fields are marked as 
"not used" in Chapter 7 and elsewhere. 

2.2.1 Register Types 

TVP401 has three main types of register: 

□ Control Registers 

□ Command Registers 

□ Internal Registers 

Control Registers are updated only by the host where the chip effectively uses 
them as read-only registers. Examples of control registers are the scissor clip 
min and max registers. Once initialized by the host, the chip only reads these 
registers to determine the scissor clip extents. Most registers are control 
registers. 
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Command Registers are those which, when written to, cause an action to 
occur. Typically, the host will initialize the appropriate control registers and 
then write to a command register to initiate drawing. Some command registers 
such as ResetPickResult or Sync do not initiate rendering. Apart from these, 
there are two types of command registers: begin-draw and continue-draw. 
Begin-draw commands cause rendering to start with those values specified by 
the control registers. Continue-draw commands cause drawing to continue 
with internal register values as they were when the previous drawing operation 
completed. Making use of continue-draw commands can significantly reduce 
the amount of data that has to be loaded into TVP401 when drawing multiple 
connected objects such as polylines. Examples of command registers include 
the Render and ContinueNewLine registers. 

For convenience, this document refers to "sending a Render command to 
TVP4010" rather than saying "the Render Command register is written to, 
which initiates drawing." 

Internal Registers are not accessible to host software. They are used internally 
by the chip to keep track of changing values. Some control registers have 
corresponding internal registers. When a begin-draw command is sent and 
before rendering starts, the internal registers are updated with the values in 
the corresponding control registers. If a continue-draw command is sent, this 
update does not happen and drawing continues with the current values in the 
internal registers. For example, if a line is being drawn then the StartXDom and 
StartY control registers specify the (x, y) coordinates of the first point in the line. 
When a begin-draw command is sent, these values are copied into internal 
registers. As the line drawing progresses, these internal registers are updated 
to contain the (x, y) coordinates of the pixel being drawn. When drawing is 
completed, the internal registers contain the (x, y) coordinates of the next point 
to be drawn. If a continue-draw command is now given, these final (x, y) 
internal values are not modified and subsequent drawing operations use these 
values. However, if a begin-draw command had been used, the internal 
registers would have been reloaded from the StartXDom and StartY registers. 

For the most part, internal registers can be ignored. It is helpful to appreciate 
that they exist in order to understand the continue-draw commands. 

2.2.2 Efficiency Issues and Register Types 

Software developers wishing to write device drivers for TVP4010 should 
become familiar with the different types of registers. Some control registers 
such as the StartXDom and StartY registers have to be updated for almost 
every primitive whereas other control registers such as those for scissor clip 
or logical ops can be updated much less frequently. Pre-loading of the 
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appropriate control registers can reduce the amount of data that has to be 
loaded into the chip for a given primitive thus improving efficiency. In addition, 
as described above, the final values in internal registers can sometimes be 
used for subsequent drawing operations. 

The tables in chapter 8 list the graphics registers according to their type, name 
and address. 
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2.3 TVP4010 I/O Interface 

There are four ways of loading the TVP4010 registers: 

□ The host writes a value to the mapped address of the register. 

□ The host writes address-tag/data pairs to the FIFO. 

□ The host writes address-tag/data pairs to the FIFO via DMA. 

□ The host writes to raw memory mapped GP FIFO addresses. 

In cases where the host writes data values directly to the chip via the register 
file, consideration has to be given to FIFO overflow (unless PCI Disconnect is 
enabled). The InFIFOSpace register indicates how many free entries remain 
in the FIFO. Before writing to any register, the host must ensure that there is 
enough space left in the FIFO. The values in this register can be read at any 
time. When using DMA, the DMAcontrollerwill automatically ensure that there 
is room in the FIFO before it performs further transfers. Thus, a buffer of any 
size up to 64K 32 bit words can be passed to the DMA controller. Details of the 
FIFO and DMA controller operation are described in subsection 2.3.2. 

2.3.1 PCI Disconnect 

The PCI bus protocol incorporates a feature known as PCI Disconnect, which 
is supported by the TVP401 0. PCI Disconnect is enabled by writing to bit zero 
of the DisconnectControl register, which is at offset 0x68 in PCI RegionO. If the 
TVP4010 is in this mode and the host processor attempts to write to the full 
FIFO, instead of the write being lost, then the TVP4010 chip asserts PCI 
Disconnect, which causes the host processor to keep retrying the write cycle 
until it succeeds. 

This feature allows faster downloading of data to the TVP401 0, since the host 
does not need to poll the InFIFOSpace register. However, care should be used 
with PCI Disconnect because the bus is effectively saturated by the host 
processor until the TVP401 frees up an entry in its FIFO. In general, this mode 
should only be used either for operations where it is known that the TVP401 
can consume data faster than the host can generate it, or where there are no 
time-critical peripherals sharing the PCI bus. 

2.3.2 FIFO Control 

The previous description in section 2.2 considered the TVP4010 interface to 
be a register file. More precisely, when a data value is written to a register, this 
value and the address tag for that register are combined and put into the FIFO 
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as a new entry. The actual register is not updated until the TVP401 processes 
this entry. In the case where the TVP4010 is busy performing a time 
consuming operation (e.g. drawing a large texture mapped polygon) and not 
draining the FIFO very quickly, it is possible for the FIFO to become full. If a 
write to a register is performed when the FIFO is full, no entry is put into the 
FIFO and that write is effectively lost. 

The input FIFO is 32 entries deep and each entry consists of a tag/data pair; 
an address word which addresses the register to be updated, followed by the 
data to be sent to the register. The InFIFOSpace register can be read to 
determine how many entries are free. The value returned by this register will 
never be greater than 32. 

An example of loading the TVP4010 registers using the FIFO follows. The 
pseudocode fills a series of rectangles. Details of the conventions used in the 
pseudocode examples can be found in Appendix B. 

It is assumed that the data to draw a single rectangle consists of eight words 
(including the Render command). 

dXDom ( 0x0 ) ; // common set-up 
dXSub (0x0) ; 
dY(l) ; 

for (i =0; i < nrects; ++i) { 
while (*InFIFOSpace < 8) 

; // wait for room 
StartXDom (rect->xl); 
StartXSub (rect->x2); 
Count (rect->y2 - rect->yl); 
YStart (rect->yl) ; 

Render (TVP4010_TRAPEZOID_PRIMITIVE) ; 

} 

For simplicity, the example above shows constant polling of InFIFOSpace. 
This is expensive in terms of bus utilization, so an improved polling loop is very 
desirable. 

To check the status of the FIFO before every write is inefficient, so it is checked 
before loading the data for each rectangle. Since the FIFO is 32 entries deep, 
a further optimization is to wait for all 32 entries to be free after every second 
rectangle. Further optimizations can be made by moving dXDom, dXSub and 
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dY outside the loop (as they are constant for each rectangle) and doing the 
FIFO wait after every third rectangle. 

The InFIFOSpace FIFO control register contains the number of entries 
currently free in the FIFO. The chip increments this register for each entry it 
removes from the FIFO and decrements it each time the host puts an entry into 
the FIFO. Before writing to the input FIFO, the user must check that there is 
sufficient space by reading the InFIFOSpace register. 

The Graphics Core (GC) FIFO interface provides a port through which both 
GC register addresses and data can be sent to the input FIFO. A range of 4 
Kbytes of host space is provided although all data may be sent through one 
address in the range. All accesses go directly to the FIFO; the range is 
provided to allow for data transfer schemes, which force the use of 
incrementing addresses. 

Note that the GC registers cannot be read through this interface. Command 
buffers to be sent to the input FIFO interface may be read directly by the 
TVP401 via the DMA controller. 

A data formatting scheme is provided to allow for multiple data words to be sent 
with one address word where adjacent or grouped registers are being written, 
or where one register is to be written many times. 

Note: 

The FIFO interface can be accessed at 32 bit boundaries. This is to allow a 
direct copy from a DMA format buffer. 

2.3.3 The DMA Interface 

Loading registers directly via the FIFO is often an inefficient way to download 
data to the TVP4010. Given that the FIFO can accommodate only a small 
number of entries, the TVP401 has to be frequently interrogated to determine 
how much space is left. Also consider situations where an API function 
requires a large amount of data to be sent to the TVP4010 . If the FIFO is 
written to directly, then a return from this function is not possible until almost 
all the data has been consumed by the TVP4010. This may take some time 
depending on the types of primitives being drawn. 

To avoid these problems, the TVP4010 provides an on-chip DMA controller 
which can be used to load data from arbitrary sized (< 64K 32-bit words) host 
buffers into the FIFO. In its simplest form, the host software has to prepare a 
host buffer containing register address tag descriptions and data values. It 
then writes the base address of this buffer to the DMAAddress register and the 
count of the number of words to transfer to the DMACount register. Writing to 
the DMACount register starts the DMA transfer and the host can now perform 
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other work. In general if the complete set of rendering commands required by 
a given call to a driver function can be loaded into a single DMA buffer, then 
the driver function can return. At the same time, the TVP401 is reading data 
from the host buffer and loading it into its FIFO. FIFO overflow never occurs 
since the DMA controller automatically waits until there is room in the FIFO 
before doing any transfers. 

The only restriction on the use of DMA control registers is that before 
attempting to reload the DMACount register the host software must wait until 
previous DMA has completed. It is valid to load the DMAAddress register while 
the previous DMA is in progress since the address is latched internally at the 
start of the DMA transfer. Many display driver functions can be implemented 
using the following skeleton structure: 

do any pre-work 

DMAAddress (address of dma_buffer) ; 
while (TRUE) { 

count = *DMACount; // note this is volatile 
if (count) { 

while (--count) 

; // wait for count to expire 
} 

else 

break; // DMA completed 
} 

copy render data into DMA buffer 
DMACount (number of words in DMA buffer) 
return 

Using DMA leaves the host free to return to the application, while in parallel, 
the TVP4010 is performing the DMA and drawing. This can increase 
performance significantly over loading a FIFO directly. In addition, some 
algorithms require that data be loaded multiple times (e.g. drawing the same 
object across multiple clipping rectangles). Since the TVP4010 DMA only 
reads the buffer data, the data can be downloaded many times simply by 
restarting the DMA. This can be very beneficial if composing the buffer data 
is a time consuming task. 

A further optimization is to use a double buffered mechanism with two DMA 
buffers. This allows the second buffer to be filled before waiting for the previous 
DMA to complete thus further improving the parallelism between host and the 
TVP401 processing. 
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do any pre-work 

get free DMA buffer and mark as in use 
put render data into this new buffer 
DMAAddress (address of new buffer) 
while (TRUE) { 

count = *DMACount; // note this is volatile 
if (count) { 

while (--count) 

; / / wait for count to expire 
} 

else 

break; // DMA completed 
} 

DMACount (number of words in new buffer) 

mark the old buffer as free 

return 

The DMA buffer format consists of a 32-bit address tag description word 
followed by one or more data words. The DMA buffer consists of one or more 
sets of these formats. The following paragraphs describe the different types 
of tag description words that can be used. 

2.3.3.1 DMA Tag Description Format 

When DMA is performed, each 32-bit tag description in the DMA buffer 
conforms to the following format. 

Figure 2-1. DMA Tag Description Format 



31 


24 


16 








8 







1 1 1 


i i i ■ i 
Count or Mask 


i i i i 

res 


i 

erved 


reserved 


■ 


Address Tag 









Mode 

= Hold tag 

1 = Increment tag 

2 = Indexed tag 

3 = Reserved 



There are three different tag addressing modes for DMA: hold, increment and 
indexed. The different DMA modes are provided to reduce the amount of data 
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that needs to be transferred, hence making better use of the available DMA 
bandwidth. Each of these modes is described in the following sections. Each 
row in the following diagrams represents a 32-bit value in the DMA buffer. The 
address tag for each register is given in Chapter 7, Graphics Register 
Reference. 

2.3.3.2 Hold Format 

address-tag with Count=n-l, Mode=0 
value 1 

value n 

The hold format is commonly used for image download by setting the SyncOn- 
HostData bit in the Render command. In this format the 32-bit tag description 
contains a tag value and a count specifying the number of data words following 
in the buffer. The DMA controller writes each of the data words to the same 
address tag. This is useful for image download where pixel data is continuous- 
ly written to the Color register. The bottom 9 bits specify the register to which 
the data should be written; the high-order 1 6 bits specify the number of data 
words (minus 1 ) which follow in the buffer and which should be written to the 
address tag (note that the 2-bit mode field for this format is zero so a given tag 
value can simply be loaded into the low order 16 bits). 

A special case of this format is where the top 1 6 bits are zero indicating that 
a single data value follows the tag (i.e. the 32-bit tag description is simply the 
address tag value itself). This allows simple DMA buffers to be constructed 
which consist of tag/data pairs. For example, to render a horizontal span 10 
pixels long starting from (2,5) the DMA buffer could look like this: 

StartXDom 
2 « 16 
StartY 
5 << 16 

StartXSubl2 << 16 
Count 

1 

Render 

(trapezoid render command) 

2.3.3.3 Increment Format 

address-tag with Count=n-l, Mode=l 
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value 1 
value n 

The increment format is similar to the hold format except that as each data 
value is loaded the address tag is incremented (the value in the DMA buffer 
is not changed; the TVP4010 updates an internal copy). Thus, this mode 
allows contiguous the TVP401 registers to be loaded by specifying a single 
32-bit tag value followed by a data word for each register. The low-order 9 bits 
specify the address tag of the first register to be loaded. The 2 bit mode field 
is set to 1 and the high-order 16 bits are set to the count (minus 1) of the 
number of registers to update. To enable use of this format, the TVP4010 
register file has been organized so that registers which are frequently loaded 
together have adjacent address tags. For example, the 8 AreaStipplePattern 
registers can be loaded as follows: 

AreaStipplePatternO , Count=7, Mode=l 
row bits 
row 1 bits 

row 7 bits 

2.3.3.4 Indexed Format 

The TVP4010 address tags are 9-bit values. For the Indexed DMA Format, 
these 9-bit values are organized into major groups and within each group there 
are up to 1 6 tags. The low-order 4 bits of a tag give its offset within the group. 
The high-order 5 bits give the major group number. See Chapter 8, for a listing 
of Register Tables, showing the individual registers with their Major Group and 
Offset. 

Figure 2-2. Indexed Register Format 

8 4 

1 1 1 1 1 1 1 1 

Major Group Offset 



The indexed register format allows up to 16 registers within a group to be 
loaded while still only specifying a single address tag description word. 

address tag with Mask, Mode=2 
value 1 



2-12 



TVP4010 I/O Interface 



value n 

If the Mode of the address tag description word is set to indexed mode, the 
high-order 16 bits are used as a mask to indicate which registers within the 
group are to be used. The bottom 4 bits of the address tag description word 
are unused. The group is specified by bits 4 to 8. Each bit in the mask is used 
to represent a unique tag within the group. If a bit is set, then the corresponding 
register will be loaded. The number of bits set in the maskdetermines the num- 
ber of data words that should be following the tag description word in the DMA 
buffer. The data is stored in order of increasing corresponding address tag. For 
example, 

0x003280F0 
value 1 
value 2 
value 3 

The Mode bits are set to 2 so this is indexed mode. The Mask field (0x0032) 
has 3 bits set so there are three data words following the tag description word. 
Bits 1 , 4 and 5 are set so the tag offsets are 1 , 4 and 5. The major group is given 
by the bits 4-8 which are OxOF (in indexed mode bits 0-3 are ignored). Thus 
the actual registers to update have address tags 0x0F1, 0x0F4 and 0x0F5. 
These are updated with value 1, value 2, and value 3 respectively. 

2.3.3.5 DMA Example 

The following pseudo-code shows the previous example of drawing a series 
of rectangles but this time using the DMA controller. This example uses a 
single DMA buffer and the simplest Hold Mode for the tag description words 
in the buffer. 

UINT32 *pbuf ; 

DMAAddress (physical address of dma_buffer) 
while (*DMACount != 0) 

; / / wait for DMA to complete 
pbuf = dma_buffer; 



*pbuf++ = TVP4010TagdXDom; 

*pbuf++ = 0; 

*pbuf++ = TVP4010TagdXSub; 

*pbuf++ = 0; 

*pbuf++ = TVP4010TagdY; 

*pbuf++ = 1 << 16; 
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(i = 0; 


i < nrects; ++i) { 


*pbuf ++ 


= TVP4010TagStartXDom; 


*pbuf ++ 


= rect->xl << 16;// Start dominant edge 


*pbuf ++ 


= TVP4010TagStartXSub 


*pbuf ++ 


= rect->x2 << 16;// Start of subordinate edge 


*pbuf ++ 


= TVP4010TagCount; 


*pbuf ++ 


= rect->y2 - rect->yl; 


*pbuf ++ 


= TVP4010TagYStart; 


*pbuf ++ 


= rect->yl << 16; 


*pbuf ++ 


= TVP4010TagRender; 


*pbuf ++ 


= TVP4010_TRAPEZOID_PRIMITIVE; 



} 



// initiate DMA 

DMACount ( (int ) (pbuf - dma_buffer)) 

The example assumes that a host buffer has been previously allocated and is 
pointed at by dma_buffer. It is worth noting that significantly less data would 
be required if indexed tags were used in this example. 

2.3.3.6 DMA Buffer Addresses 

The host software must generate the correct DMA buffer address for the 
TVP401 DMA controller. This typically means that the address passed to the 
TVP401 must be the physical address of the DMA buffer in host memory. The 
buffer must also reside at contiguous physical addresses as accessed by the 
TVP4010. On a system which uses virtual memory for the address space of 
a task, some method of allocating contiguous physical memory and mapping 
this into the address space of a task must be used. 

If the virtual memory buffer maps to non-contiguous physical memory, the 
buffer must be divided into sets of contiguous physical memory pages and 
each of these sets must be transferred separately. In this situation, the whole 
DMA buffer cannot be transferred in one go; the host software must wait for 
each set to be transferred. Often the best way to handle these fragmented 
transfers is via an interrupt handler. 

2.3.3.7 DMA Interrupts 

The TVP4010 provides interrupt support, as an alternative means of 
determining when a DMA transfer is complete. This interrupt support can 
provide a considerable speed advantage. If enabled, the interrupt is generated 
whenever the DMACount register changes from a non-zero to a zero value. 
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Since the DMACount register is decremented every time a data item is 
transferred from the DMA buffer, this happens when the last data item is 
transferred from the DMA buffer. 

To enable the DMA interrupt, the DMAInterruptEnable bit must be set in the 
IntEnable register. The interrupt handler should check the DMAFlag bit in the 
IntFlags register to determine that a DMA interrupt has actually occurred. To 
clear the interrupt a word should be written to the IntFlags register with the 
DMAFlag bit set to one. 

A typical use of DMA interrupts might be as follows: 

prepare DMA buffer 

DMACount (n) ; // start a DMA transfer 
prepare next DMA buffer 
while ( *DMACount != 0) { 
mask interrupts 

set DMA Interrupt Enable bit in IntEnable register 
sleep on interrupt handler wake up 
unmask interrupts 

} 

DMACount (n) // start the next DMA sequence 

The interrupt handler could then be 

if (*IntFlags & DMA Flag bit) { 
reset DMA Flag bit in IntFlags 
send wake up to main task 

} 

Interrupts are complicated and depend on the facilities provided by the host 
operating system. The pseudocode above only hints at the system details. 

This scheme frees the processor for other work while DMA is being completed. 
Since the overhead of handling an interrupt is often quite high for the host 
processor, the scheme should be tuned to allow a period of polling before 
sleeping on the interrupt. 

2.3.4 Output FIFO and Graphics Processor FIFO Interface 

To read data back from the TVP401 an output FIFO is provided. Each entry 
in this FIFO is 32-bits wide, and it can hold either tag or data values. Thus its 
format is unlike the input FIFO whose entries are always tag/data pairs (think 
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of each entry in the input FIFO as being 41 bits wide; 9 bits for the tag and 32 
bits for the data). The type of data written by the TVP401 to the output FIFO 
is controlled by the FilterMode register. This register allows filtering of output 
data in various categories including the following: 

□ Depth: the output in this category results from an image upload of the 
Depth buffer. 

□ Stencil: the output in this category results from an image upload of the 
Stencil buffer. 

□ Color: the output in this category results from an image upload of the 
framebuffer. 

□ Synchronization : synchronization data is sent in response to a Sync com- 
mand. 

The data for the FilterMode register consists of 2 bits per category. If the least 
significant of these two bits is set (0x1), output of the register tag for that 
category is enabled. If the most significant bit is set (0x2), output of the data 
for that category is enabled. Both the tag and data output can be enabled at 
the same time. In this case, the tag is written first to the FIFO followed by the 
data. The FilterMode register is described in more detail in Section 4.15. 

For example, to perform an image upload from the framebuffer, the FilterMode 
register should have data output enabled for the Color category. Then, the 
rectangular area to be uploaded should be described to the Rasterizer. Each 
pixel that is read from the framebuffer is placed into the output FIFO. When the 
output FIFO is full, the TVP4010 blocks internally until space becomes 
available. It is the programmer's responsibility to read all data from the output 
FIFO. For example, it is important to know how many pixels should result from 
an image upload and to read exactly this many from the FIFO. 

To read data from the output FIFO, the Output FIFO Words register should first 
be read to determine the number of entries in the FIFO (reading from the FIFO 
when it is empty returns undefined data). This is the number of 32-bit data 
items to be read from the FIFO. This procedure is repeated until all the 
expected data or tag items have been read. The address of the output FIFO 
is described below. 

All expected data must be read back. The TVP4010 will block when the output 
FIFO is full. Programmers must be careful to avoid the deadlock condition that 
results if the host is waiting for space to become free in the input FIFO while 
the TVP401 is waiting for the host to read data from the output FIFO. 

2.3.5 Graphics Processor FIFO Interface 

The TVP401 has a sequence of 1 K x 32 bit addresses in the PCI Region 
address map called the Graphics Processor FIFO Interface. To read from the 
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output FIFO, any address in this range can be read (normally a program will 
choose the first address and use this as the address for the output FIFO). All 
32-bit addresses in this region perform the same function. The range of 
addresses is provided for data transfer schemes that force the use of 
incrementing addresses. 

Writing to a location in this address range provides raw access to the input 
FIFO. Again, the first address is normally chosen. Thus, the same address can 
be used for both input and output FIFOs. Reading this location gives access 
to the output FIFO; writing gives access to the input FIFO. 

Writing to the input FIFO by this method is different from writing to the memory 
mapped register file. Since the register file has a unique address for each 
register, writing to this unique address allows the TVP4010 to determine the 
register for which the write is intended. This allows a tag/data pair to be 
constructed and inserted into the input FIFO. When writing to the raw FIFO 
address an address tag description must first be written followed by the 
associated data. In fact, the format of the tag descriptions and the data that 
follows is identical to that described above for DMA buffers. Instead of using 
the TVP4010 DMA, it is possible to transfer data to the TVP4010 by 
constructing a DMA-style buffer of data and then copying each item in this 
buffer to the raw input FIFO address. Based on the tag descriptions and data 
written the TVP4010 constructs tag/data pairs to enter as real FIFO entries. 
The DMA mechanism can be thought of as an automatic way of writing to the 
raw input FIFO address. 

Note, that when writing to the raw FIFO address the FIFO full condition must 
still be checked by reading the InFIFOSpace register. However, writing tag 
descriptions does not cause any entries to be entered into the FIFO; such a 
write simply establishes a set of tags to be paired with the subsequent data. 
Thus, free space need be ensured only for actual data items that are written 
(not the tag values). For example, in the simplest case where each tag is 
followed by a single data item and assuming that the FIFO is empty, then 32 
writes are possible before checking again for free space. 

See the TVP4010 Data Manual for more details of the Graphics Processor 
FIFO Interface address range. 
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2.4 Interrupts 

The TVP4010 provides interrupt facilities for the following: 

□ Sync: issued when a Sync command is sent with the interrupt bit set; used 
to indicate that the graphics processor is idle. Synchronization is 
described further in subsection 2.5. 

□ Error: issued under certain error conditions, such as an attempt to write 
to a full FIFO. 

□ Vertical Retrace: issued at the start of the vertical blank period. 

□ Scanline: issued at the start of a specified scanline. 

□ DMA: functions as described in subsection 2.3.3 

All interrupts can be individually enabled and disabled. Refer to the TVP4010 
Data Manual for more details. 
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2.5 Synchronization 

There are two main cases where the host must synchronize with the TVP401 0: 

□ before reading back from the TVP401 registers 

□ before directly accessing the memory via the bypass mechanism 

Also the host must synchronize with the TVP4010 for framebuffer 
management tasks such as double buffering, though this may be better 
handled using the SuspendUntilFrameBlank command. Synchronizing with 
the TVP401 implies waiting for any pending DMA to complete and waiting for 
the chip to complete any processing currently being performed. The following 
pseudo-code shows the general scheme: 

TVP4010Data data; 

// wait for DMA to complete 

while (*DMACount != 0) { 

poll or wait for interrupt 

} 

while (*InFIFOSpace < 2) { 

; // wait for free space in the FIFO 

} 

// enable sync output and send the Sync command 
data. Word = 0; 

data . FilterMode . Synchronization = 0x1; 
FilterMode (data .Word) ; 
Sync (0x0) ; 

/* wait for the sync output data */ 
do { 

while (*OutFIFOWords == 0) 

; // poll waiting for data in output FIFO 
} while (*OutputFIFO != Sync_tag) ; 

Initially, wait for DMA to complete as normal. Then wait for space to become 
free in the FIFO (since the DMA controller actually loads the FIFO). Space for 
two registers is needed: one register to enable generation of an output sync 
value, and the second register for the Sync command itself. The enable flag 
can be set at initialization time. The output value is generated only when a 
Sync command has been sent and the TVP4010 has completed all 
processing. 
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Rather than polling, it is possible to use a Sync interrupt as mentioned in the 
previous section. As well as enabling the interrupt and setting the filter mode, 
the data sent in the Sync command must have the most significant bit set in 
order to generate the interrupt. The interrupt is generated when the tag or data 
reaches the output end of the Host Out FIFO. Use of the Sync interrupt has 
to be considered carefully as the TVP401 will generally empty the FIFO more 
quickly than it takes to set up and handle the interrupt. 
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2.6 Host Memory Bypass 

Normally, the host will access memory indirectly via commands sent to the 
TVP4010 FIFO interface. However, the TVP4010 does provide the whole 
memory as part of its address space so that it can be memory mapped by an 
application. Access to the memory via this route is independent of the 
TVP4010FIFO. 

Drivers may choose to use direct access to memory for algorithms which are 
not supported by the TVP4010 or for better performance in some specific 
cases. This may be so, for example, when multiple pixels can be written 
simultaneously and there is minimal host software overhead. 

A driver using the bypass mechanism should synchronize memory accesses 
made through the FIFO with those made directly through the memory map. If 
data is written to the FIFO and then an access is made to the memory, it is 
possible that the memory access will occur before the commands in the FIFO 
have been fully processed. This lack of temporal ordering is generally 
undesirable. 

There are two windows through which the memory can be accessed. Each 
window can have its own data formatting control that allows for different forms 
of byte swapping and data packing. If the framebuffer is set to use the 
5:5:5:1 Front and 5:5:5:1 Back color modes, two pixels are packed into each 32 
bit word, but each pixel belongs to a different buffer. Adjacent pixels in the 
same buffer are separated by 16 bits. As some software has difficulty with 
pixels that are not packed together, the memory windows can be configured 
to remap the data so that only the front or back buffer is visible, and it appears 
packed. 
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2.7 Register Read Back 

Under some operating environments, multiple tasks will want access to the 
TVP401 chip. Sometimes a server task or driver will want to arbitrate access 
to the TVP401 on behalf of multiple applications. In these circumstances, the 
state of the TVP401 chip may need to be saved and restored on each context 
switch. To facilitate this, the TVP401 registers can be read back. For details 
of which registers are readable, see Appendix D, Register Tables. Internal and 
command registers cannot be read back. 

To perform a context switch the host must first synchronize with the TVP401 0. 
This means sending a Sync command and waiting for the sync output data to 
appear in the output FIFO. After this the registers can be read back. 

To read the TVP4010 register, the host reads the same address that is used 
for a write, i.e. the base address of the register file plus the offset value for the 
register. 

Note that since internal registers cannot be read back, care must be taken 
when context switching a task that is making use of continue-draw commands. 
Continue-draw commands rely on the internal registers maintaining their 
previous state. This state will be destroyed by any rendering work done by a 
new task. To prevent this, continue-draw commands should be performed via 
DMA since the context switch code has to wait for outstanding DMA to 
complete. Alternatively, continue-draw commands can be performed in a 
non-preemptable code segment. 

Normally, reading back individual registers should be avoided. The need to 
synchronize with the chip can adversely affect performance. It is usually more 
appropriate to keep a software copy of the register that is updated whenever 
the actual register is changed. 
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2.8 Byte Swapping 

Internally the TVP401 operates in little-endian mode. However, the TVP401 
is designed to work with both big-endian and little-endian host processors. 
Since the PCI Bus specification defines that byte ordering is preserved 
regardless of the size of the transfer operation , the TVP401 provides facilities 
to handle byte swapping. See the TVP401 Data Reference Manual for more 
details of byte-swapping via the PCI bus. 

Additional support is provided within the graphics core of the chip to byte swap 
images and bitmasks as they are transferred to and from the host. These 
support mechanisms are documented in chapter 4. 
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2.9 Red and Blue Swapping 

For a given graphics board the RAMDAC and/or API will usually force a given 
interpretation for true color pixel values. For example, 32-bit pixels will be 
interpreted as either RGB (red at byte 2, green at byte 1 and blue at byte 0) 
or BGR (blue at byte 2 and red at byte 0). The byte position for red and blue 
may be important for software which has been written to expect one byte order 
or the other, in particular when handling image data stored in a file. 

The TVP401 provides three registers to specify the byte positions of blue and 
red internally. In the Texture/Fog/Blend unit the AlphaBlendMode register 
contains a 1 -bit field called ColorOrder. If the color order bit is set to zero, the 
byte ordering is BGR; if the bit is set to one, the ordering is RGB. As well as 
setting this bit in the Alpha Blend unit, it must also be set in the Color Format 
unit and the Texture Read unit via the DitherMode and TextureDataFormat 
registers. 
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Memory I/O and Organization 



This section describes the arrangement of data stored in memory. Although 
the TVP4010 has a single unified memory space. For ease of reference, this 
is divided into three buffers: the localbuffer, framebuffer and texture buffer. Any 
of these buffers can be any size at any position in the memory. 

For 3D operation associated with the framebuffer there would normally be a 
localbuffer to hold depth and/or stencil information. A texture buffer may be 
present if needed. For 2D operation the localbuffer is not generally be used, 
but the texture buffer may be used to store pixmaps. 
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The TVP401 supports an optional scheme for organizing memory, known as 
"patching". Data is normally stored linearly in memory such that incrementing 
addresses move from left to right along a scanline of the appropriate buffer. 
The type of memory supported by the TVP4010 uses a page structure which 
allows fast accesses within a 2-Kbyte region, but imposes a penalty for moving 
to a new 2-Kbyte region. This page structure favors access patterns that move 
along a scanline but is inefficient for moving vertically as the large change in 
address may cause a page break. 

Patched data is organized so that there is less penalty for moving vertically in 
a buffer at the expense of a decrease in performance for moving horizontally. 
This is done by organizing memory such that a two-dimensional region or 
patch in the buffer corresponds to a linear sequence in memory. A buffer 
comprises many patches. 

Two patch modes are supported which differ in the detail of how the data is 
organized within the patch. Normal patch mode is used for localbuffer and 
framebuffer data. Subpatch mode is used for texture and framebuffer data. 
Patched data cannot be displayed, so patching of framebuffer data is normally 
only done for off-screen bitmaps or when processing localbuffer or texture data 
through the framebuffer units. 

The TVP4010 has full support for patching data that is downloaded from the 
host. If the programmer wishes to supply data that is already patched, the 
following algorithms show how to generate the XY coordinate in the source 
data for a given XY coordinate in the destination buffer. 

For normal patch data X and Y coordinate bits are combined as follows: 

X coord bit pattern = = Xo, Xi, X2, Yo, Yi, Y2, X3, X4, X5, Xg, 

Y coord bit pattern = = Y & ~0x7 

For subpatch data X and Y coordinate bits are combined as follows: 

X coord bit pattern = = 

X , Y ,Xi, Yi,X 2 , Y2,X 3 ,Y3,X 4 , Y 4 ,X 5 ,X 6 ,H 

Y coord bit pattern = = Y & ~0xlF 

If texture data is 4 bits per texel, it is loaded two texels at a time as 8-bit values. 
In this mode a different patch algorithm is used to allow for writing data as 8 
bits and reading it as 4 bits. For subpatchpack data X and Y coordinate bits 
are combined as follows, where each XY value indexes 8 bits: 

X coord bit pattern = = 

Yo, Xg, Yi, Xi, Y2, X2, Y3, X3, Y4, X4, X5, Xg, E 

Y coord bit pattern = = Y & -OxlF 
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3.2 Localbuffer 



The localbuffer holds the Depth and Stencil information corresponding to each 
displayed pixel. The Depth field can be either 1 5 or 1 6 bits wide and the Stencil 
field either 1 or bits wide. The total width of the localbuffer data cannot be 
greater than 16 bits. If a Stencil field is defined, then it occupies bit 15; the 
depth field always starts at bit 0. 

The format of the localbuffer is specified in two places: the LBReadFormat 
register and the LBWriteFormat register. 



The translation from the internal coordinate system to the external address 
map involves setting the base address of the window (or screen if coordinates 
are screen relative) and positioning the origin in either the top left or bottom 
left corner. The origin is specified in the LBReadMode register. 

The actual equations used to calculate the localbuffer address to read and 
write are: 

Bottom left origin: 

Destination address = LBWindowBase - Y * W + X 

Source address = LBWindowBase - Y * W + X + LBSour- 
ceOf f set 

Top left origin: 

Destination address = LBWindowBase + Y * W + X 

Source address = LBWindowBase + Y * W + X + LBSour- 
ceOf f set 



3.2.1 Localbuffer Coordinates 



where: 



X 



is the pixel's X coordinate, 
is the pixel's Y coordinate. 



Y 



LBWindowBase 



holds the base address in the localbuffer of the current window. 



LBSourceOffset 



is normally zero except during a copy operation where data is 
read from one address and written to another address. The 
offset between source and destination is held in the 
LBSourceOffset register. 



W 



is the screen width. Only a subset of widths are supported and 
these are encoded into the PPO, PP1 and PP2 fields in the 
LBReadModeregister. See Appendix C for more details. 
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This produces the localbuffer address in pixels. For the TVP4010, the 
localbuffer data is always 1 6 bits so the physical byte address is two times the 
pixel address. The destination address is the address that data will be written 
to; data may also be read from this address if read-modify-write operations are 
needed such as depth testing. The source address is mainly used for copy 
operations and is only used for reading data. 
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3.3 Framebuffer 

The framebuffer holds color data produced by the TVP401 0. The framebuffer 
may hold both displayed and non-displayed data. Color buffers can be placed 
anywhere in memory; there is no restriction on areas that can be displayed 
from. 

There may be several buffers, such as the front and back buffers of a double 
buffered system, or the left and right buffers of a stereo system. No restrictions 
are placed on the number or organization of the buffers other than the total 
amount of memory fitted. 

To access alternative buffers either the FBPixelOffset register can be loaded, 
or the base address of the window held in the FBWindowBase register can be 
redefined. 

3.3.1 Framebuffer Coordinates 

Coordinate generation for the framebuffer is similar to that for the localbuffer 
except for the addition of FBPixelOffset. The WindowOrigin bit in the FBRead- 
Mode register selects top left or bottom left as the origin for the framebuffer. 

The actual equations used to calculate the framebuffer address to read and 
write are: 

Bottom left origin: 

Destination address = FBWindowBase - Y * W + X + FBPixe- 
lOffset 

Source address = FBWindowBase - Y * W + X + FBPixelOffset 
+ FBSourceOf f set 

Top left origin: 

Destination address = FBWindowBase + Y * W + X + FBPixe- 
lOffset 

Source address = FBWindowBase + Y * W + X + FBPixelOffset 
+ FBSourceOf f set 
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where: 



X 


is the pixel's X coordinate, 


Y 


is the pixel's Y coordinate, 


FBWindowBase 


holds the base address in the framebuffer of the current 




window. 


rbrlXeiUTTSet 


is normally zero except when multi-buffer writes are needed 




when it gives a way to access pixels in alternative buffers with- 




out changing the FBWindowBase register. This is useful as the 




window system may be asynchronously changing the window's 




position on the screen. It is held in the FBPixelOffset register. 


FBSourceOffset 


is normally zero except during a copy operation where data is 




read from one address and written to another address. The 




FBSourceOffset is held in the FBSourceOffset register. 


W 


is the screen width. Only a subset of widths are supported and 




these are encoded into the PPO, PP1 and PP2 fields in the 




FBReadMode register. See the table in Appendix B for more 




details. 



These address calculations translate a 2D address into a linear address so 
non-power of two framebuffer widths (e.g. 640) are economical in memory. 
The address is in pixels; this is translated to a physical byte address by 
multiplying by the number of bytes in the pixel. 

The width is specified as the sum of selected partial products which are 
selected by the fields PPO, PP1 and PP2 in the FBReadMode register. This is 
the same mechanism used to set the width of the localbuffer; however, the 
widths may be set independently. The range of widths supported are tabulated 
in Appendix C, together with the values for each of the PP fields. This table 
holds all the common screen widths. 

For arbitrary screen sizes, for instance when rendering to off-screen memory 
such as bitmaps the next largest width from the table must be chosen. The 
difference between the table width and the bitmap width is the unused strip of 
pixels down the right hand side of the bitmap. 

Note that such bitmaps can be copied to the screen only as a series of 
scanlines rather than as a rectangular block, unless the Texture Read unit is 
used. In this case the stride for the read can be set differently from the write 
by means of the partial products. However, windowing systems often store 
offscreen bitmaps in rectangular regions that use the same stride as the 
screen. In this case normal bitblts can be used. 
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3.3.2 Framebuffer Color Formats 

The contents of the framebuffer can be regarded in two ways: 

□ As a collection of fields of up to 32 bits with no meaning or assumed format 
as far as the TVP401 is concerned. Bit planes may be allocated to control 
cursor, color look up tables (LUTs), multi-buffer visibility or priority 
functions. In this case the TVP401 will be used to set and clear bit planes 
quickly but not perform any color processing such as interpolation or 
dithering. All color processing can be disabled so that raw reads and writes 
are done and the only operations performed are writemasking and logical 
ops. This allows the control planes to be updated and modified as 
necessary. 

□ As a collection of one or more color components. All the processing of 
color components, except for the final writemask and logical ops are done 
using the internal color format . The final stage before writemask and 
logical ops processing converts the internal color format to that required 
by the physical configuration of the framebuffer and video logic. The range 
of supported formats are given in Table 3-1 . The nomenclature n@m 
means this component is n bits wide and starts at bit position m in the 
framebuffer. The least significant bit position is and a dash in a column 
indicates that this component does not exist for this mode. 

Some important points to note: 

□ The alpha channel, when present, is always associated with the RGB 
color channels rather than being a separate buffer. This allows it to be 
moved in parallel and to work correctly in multi-buffer updates and double 
buffering. 

□ For the Front and Back modes the data value is duplicated in both buffers. 
In general, if the data format does not take 32 bits the data is repeated in 
the empty bit planes. If the data format requires 8 bits, the same value is 
repeated in all four bytes of the word. The pixel size determines how many 
of the bytes are written to memory. If a 1 6 bit format is chosen (e.g. 
5:5:5:1), the data is repeated in the upper and lower halves of the word. 
If the pixel size is set to 1 6 bits, only half the word is written to memory; 
if the pixel size is set to 32 bits then both halves are written, with the same 
data in each. A writemask can be used to select which bits are written. This 
is used for certain types of double buffering. The front and back modes are 
used in the alpha blend unit to extract the appropriate buffer. 

□ The offset modes (1 and 1 1 ) format the colors into a 7 bit value and then 
add 64 to the result. This avoids reserved entries in window-system color 
tables. 
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□ YUV formats are only available as textures, the TVP4010 can convert 
YUV textures to RGB and apply them to polygons; it cannot convert RGB 
to YUV for storage. If a YUV texture is being loaded into the chip it should 
be done as raw data or converted to RGB as it is loaded. 

□ The CI4 format is only available as a texture. 

□ When reading the framebuffer, RGBA components are scaled to their 
internal width if needed for alpha blending. 

□ The color format of the framebuffer is independent of the color format of 
the texture buffer; the texture buffer supports the same formats as the 
framebuffer plus some for YUV color formats. 

Color information is stored as values of red, green and blue (RGB) with or with- 
out alpha values. Alternatively, it can be stored as a color index value (CI) 
where each value references an entry in a color look up table that contains 
RGB values. 

The color format information needs to be stored in three places: the Dither- 
Mode register, the AlphaBlendMode register and the TextureDataFormat 
register. 



Framebuffer 



Table 3-1. Supported Color Formats 



Internal Color Channels 





Format 


Color Order Name 


R/Y 


G/U 


B/V 


A 


BGR 





BGR 


8:8:8:8 


8@0 


8@8 


8@16 


8@24 




1 


BGR 


5:5:5:1 Front 


5@0 


5@5 


5@10 


1@15 




2 


BGR 


4:4:4:4 


4@0 


4(3)4 


4@8 


4@12 




5 


BGR 


3:3:2Front 


3@0 


3@3 


2@6 







6 


BGR 


3:3:2Back 


3@8 


3(3)11 


2@14 







9 


BGR 


2:3:2:1 Front 


2@0 


3@2 


2@5 


1@7 




10 


BGR 


2:3:2:1 Back 


2@8 


3@10 


2@13 


1@15 




11 


BGR 


2:3:2FrontOff 


2@0 


3@2 


2@5 







12 


BGR 


2:3:2BackOff 


2@8 


3@10 


2@13 







13 


BGR 


5:5:5:1 Back 


5(3)16 


5@21 


5@26 


1@31 




16 


BGR 


5:6:5Front 


5@0 


6@5 


5@11 







17 


BGR 


5:6:5Back 


5@16 


6@21 


5@27 





YUV 


18 


BGR 


YUV444 


8@0 


8@8 


8@16 


8@24 




19 


BGR 


YUV422 


8@0 


8@8 


8@8 





RGB 





RGB 


8:8:8:8 


8@16 


8@8 


8@0 


8@24 




1 


RGB 


5:5:5:1 Front 


5(3)10 


5@5 


5@0 


1@15 




2 


RGB 


4:4:4:4 


4@8 


4@4 


4@0 


4@12 




5 


RGB 


3:3:2Front 


3@5 


3@2 


2@0 







6 


RGB 


3:3:2Back 


3(3)13 


3@10 


2@8 







9 


RGB 


2:3:2:1 Front 


2@5 


3@2 


2@0 


1@7 




10 


RGB 


2:3:2:1 Back 


2@13 


3@10 


2@8 


1@15 




11 


RGB 


2:3:2FrontOff 


2@5 


3@2 


2@0 







12 


RGB 


2:3:2BackOff 


2@13 


3@10 


2@8 







13 


RGB 


5:5:5:1 Back 


5@26 


5@21 


5@16 


1@31 




16 


RGB 


5:6:5Front 


5(3)11 


6@5 


5@0 







17 


RGB 


5:6:5Back 


5(3)27 


6@21 


5@16 





YUV 


18 


RGB 


YUV444 


8(3)16 


8@8 


8@0 


8@24 




19 


RGB 


YUV422 


8(3)8 


8@8 


8@0 





CI 


14 




CI8 


8(3)0 













15 




CI4 


4(3)0 












Notes: 1 ) The DitherMode register does not support the YUV444, YUV422 or CI4 formats. 

2) The AlphaBlendMode register does not support the YUV444, YUV422 or CI4 formats. 
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3.3.3 Special Memory Modes 

The TVP4010 uses Synchronous Graphics RAM (SGRAM) to store data. 
SGRAM devices usually have special features that are particularly useful for 
graphics. 

3.3.3.1 Hardware Writemasks 

These allow writemasking in the framebuffer without incurring a performance 
penalty. If hardware writemasks are not available, the TVP4010 must be 
programmed to read the memory, merge the value with the new value using 
the writemask, and write it back. 

To use hardware writemasking, the required writemask is written to the 
FBHardwareWriteMask register, the FBSoftwareWriteMask register should 
be set to all 1s, and the number of framebuffer reads is set to (for normal 
rendering). This is achieved by clearing the ReadSource and ReadDestination 
enables in the FBReadMode register. 

To use software writemasking (if hardware masks are not available), the 
required writemask is written to the FBSoftwareWriteMask register and the 
number of framebuffer reads is set to 1 (for normal rendering). This is achieved 
by setting the ReadDestination enable in the FBReadMode register. 

3.3.3.2 Block Writes 

Block writes cause consecutive pixels in the framebuffer to be written 
simultaneously. This is useful when filling large areas but does have some 
restrictions: 

□ No depth or stencil testing can be done 

□ All the pixels must be written with the same value so no color interpolation, 
alpha blending, dithering or logical ops can be done 

Block writes are not restricted to rectangular areas and can be used for any 
trapezoid. Hardware writemasking is available during block writes, but not 
software writemasking. The scissor tests and extent checking operate 
correctly with block writes, and bitmask patterns can be applied. 

The FBBIockColor register holds the value to write to each pixel. Note that this 
register should not be updated immediately after a Render command which 
performs a block write. 

Sending a Render command with the PrimitiveType field set to "trapezoid" and 
the FastFillEnable field set will then cause block filling of the area. Note that 
during a block fill any inappropriate state is ignored so even when stippling, 
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color interpolation, depth testing and/or logical ops, for example, are enabled 
they have no effect. However, scissor clipping does function correctly with 
block writes. 

The TVP4010 always writes 32 pixels per block fill. It takes care of any partial 
blocks at the beginning or end of spans. 



Memory I/O and Organization 



3-11 



Double Buffering 



3.4 Double Buffering 

Double buffering is a technique used to achieve visually smooth animation, by 
rendering a scene to an offscreen buffer, known as the back buffer, before 
quickly displaying it. 

Which techniques are available is dependant on the board design; however, 
this section discusses how the TVP401 may be used to provide support for 
four common types of double buffering, assuming that the framebuffer 
memory and LUT-DAC have the necessary capabilities. 

□ BitBIt 

□ Full Screen 

□ Bitplane 

For further details see section 4.1 2.6, 4.1 2.7 and 4.1 4 of this manual, and refer 
to the relevant RAMDAC data sheet. 

3.4.1 BitBIt Double Buffering 

BLT double buffering in its simplest form requires a complete duplicate buffer 
of non-displayed display RAM to be maintained. To swap buffers, a BLT is 
performed to the displayable area. The features are: 

□ takes significant time to swap buffers 

□ the offscreen buffer requires as much RAM as the displayed buffer 

□ any number of windows can be independently double buffered 

□ pixel depth is limited only by the amount of available RAM. 

The BLT can be performed using the texture units to allow arbitrary scaling and 
filtering of data. 

3.4.2 Full Screen Double Buffering 

This section describes how to implement full-screen double buffering with the 
TVP4010 when using the video timing generator. To perform full-screen 
double buffering, the available display RAM must be partitioned into two parts: 
buffer and buffer 1 , each of which contains enough memory to display a full 
screen of pixel information. The partitioning consists of deciding the offset into 
RAM at which a given buffer starts. This offset is used to program various the 
TVP4010 registers. For a given resolution and pixel depth there must be 
enough RAM configured on the display adapter for this to be possible. For 
example, with 32-bit deep pixels and 4MB of RAM it is possible to implement 
full-screen double buffering at 800x600 resolution, but not at 1024x768. 
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There are two factors to consider for full-screen double buffering. Firstly, the 
video output hardware must be configured to display the pixels from the 
correct buffer. Secondly, the TVP4010 chip must be programmed to render 
into the correct buffer. To achieve smooth animations, the buffer being 
rendered into is usually different from the buffer being displayed. 

3.4.2.1 Video Output 

To display a given buffer, the video output hardware must be programmed with 
the offset of that buffer in RAM. In the TVP401 internal timing generator, this 
is controlled by the ScreenBase register located in the TVP401 control space 
at offset 0x3000. It is updated immediately when it is written, but it is not used 
by the video hardware until the start of the next frame. 

3.4.2.2 TVP4010 Rendering 

The video output hardware, restricts the position of each buffer to be on a RAS 
boundary. When determining the memory location of a pixel being rendered, 
the TVP4010 operates in screen coordinates. 

To simplify the calculation of pixel coordinates that are loaded into the 
TVP4010, this value may be loaded into the FBPixelOffset register. The last 
thing the TVP4010 does before passing a pixel address to the framebuffer 
interface is to add the value in the FBPixelOffset register to its address. Thus 
it is possible to move the rendering origin to any pixel location in memory. 
When swapping buffers it is normal to move this position to be the pixel at 
which a given buffer starts. 

These values can be pre-calculated at system start-up ready to be loaded as 
required. 

3.4.2.3 Synchronization 

Double buffering allows the displaying of one buffer (the front buffer) while 
rendering into the other (the back buffer). When the rendering has been 
completed to the back buffer, the buffers are swapped and rendering continues 
into the new back buffer. As a general rule, buffers should not be swapped until 
all rendering to the back buffer is completed so that the buffer swap does not 
result in visible tearing or screen break-up. 

The TVP4010 reads the ScreenBase register at the end of each vertical 
blanking period to determine the starting pixel for the next frame to be 
displayed. Thus, in principle, this register can be written at any time to swap 
buffers and will only take effect on the next frame. The same is not true of 
loading the FBPixelOffset register. This register gets updated as soon as the 
command to load works its way through the input FIFO. Hence, any rendering 
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that takes place after the FBPixelOffset has been loaded occurs in the new 
buffer. If care is not taken, this can result in rendering being seen before the 
buffers have been swapped. The following scheme will probably produce 
picture break-up: 

ScreenBase = BufO_Addr // display buffer 

FBPixelOffset = Bufl_Offset // draw to buffer 1 now 
Render Commands // draw next frame 

ScreenBase = Bufl_Addr // display buffer 1 

FBPixelOffset =0 // draw to buffer now 

Render Commands // draw next frame 

There are two problems here. Firstly, even though the write to the ScreenBase 
register happens immediately, the TVP4010 does not actually swap the 
buffers until the end of the next vertical blanking period. Thus the start of 
rendering of the next frame may be seen in the front buffer prior to the buffer 
swap. Secondly, once a command has been loaded into the input FIFO the 
host is free to continue with other work, while the TVP4010 executes the 
command. Accesses to the ScreenBase register bypass the FIFO so it is 
possible for the host to update it, and for the buffer swap to happen, before the 
TVP401 has completed rendering the last frame. 

The TVP4010 includes the SuspendUntilFrameBlank command to solve 
these problems without the need for the host synchronizing with the TVP401 0. 
Here is the preferred version of the above example: 

SuspendUntilFrameBlank (parameters ) // display buffer 
FBPixelOffset = Bufl_Offset // draw to buffer 1 now 
Render Commands // draw next frame 

SuspendUntilFrameBlank (parameters ) // display buffer 1 
FBPixelOffset =0 // draw to buffer now 

Render Commands // draw next frame 

The SuspendUntilFrameBlank command flushes all outstanding reads and 
writes to the framebuffer, and prevents any further framebuffer memory 
accesses until after the buffers have been swapped. 

The data that is loaded into the SuspendUntilFrameBlank command enables 
the TVP401 to swap the buffers automatically when the VBLANK occurs by 
loading a new buffer offset into the ScreenBase register as discussed 
previously. See the detailed description in the Appendix A, register reference. 

Thus, a single command register access ensures that: 

□ all rendering has completed to the back buffer 

□ the chip will wait for VBLANK before carrying out the swap 

□ the host can continue sending rendering commands to the TVP4010 
without the risk of them affecting the displayed buffer. 
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As a general performance note, it is best to send non-framebuffer related 
commands to the TVP4010 following the SuspendUntilFrameBlank 
command. For example, any commands to clear the depth buffer between 
frames should be sent as these will not affect the framebuffer and will be 
executed while the TVP401 waits for the VBLANK. This allows better overlap 
between the host and the TVP401 0. In general, any commands that will not 
cause rendering to the framebuffer to occur can be queued in the TVP4010 
FIFO before waiting on VBLANK. 

Eventually more framebuffer rendering commands are sent by the host, and 
the TVP4010 stalls its hyperpipeline until the buffer swap completes. Ideally 
the host should use this time to perform non-rendering operations, for 
example, to prepare additional DMA buffers 

Using this scheme the host will not normally need to wait for VBLANK, unless 
it is making framebuffer memory accesses through the bypass. 

To wait for VBLANK, the LineCount register can be polled. There is also a 
VBLANK interrupt available (see the TVP4010 Data Manual tor details). The 
LineCount register is reset at the start of the VBLANK period and is 
incremented by one for each scanline as the video scanner moves down the 
screen. Thus, polling for this register to have a value of less than the value held 
in the VbEnd register indicates that the TVP401 is in the VBLANK period. 

3.4.3 Bitplane Double Buffering 

Bitplane double buffering is of use at 32 bits per pixel framebuffer depth using 
32768 colors in 5:5:5:1 true-color mode. It relies on the board being designed 
with a RAMDAC which will select between the high and low 1 6 bits of its input 
stream based on whether bit 31 is set or clear. Effectively the front and back 
buffer for each pixel become interleaved within the same 32-bit word in the 
framebuffer, i.e. buffer becomes the lower 1 6 bits and buffer 1 becomes the 
upper 16 bits. 

The buffer swap is thus implemented as a block fill of bit 31 of the interior of 
a window with either one or zero. While this is not as quick as full screen double 
buffering which just requires a single register ScreenBase to be updated, it is 
many times quicker than BitBIt double buffering, and like the BitBIt case allows 
any number of windows to be hardware double buffered simultaneously. 

Note that when rendering GUI data (such as window borders, titles etc.) bit 31 
must always be set to the same value so that these pixels are always displayed 
from the same buffer. The hardware writemask can then be used to write to 
only the high, or only the low, 1 6 bits when rendering the animating contents 
of a window. 
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The features are: 

□ "almost instantaneous" buffer swap 

□ no offscreen buffer required (e.g., 1152x900 would be the maximum 
resolution on a 4MB framebuffer at 32bpp depth) 

□ multiple windows can be double buffered. GUI can write with no 
performance penalty. 

□ Only useful at 5:5:5:1 RGB color depth. 

□ No triple buffering or other advanced buffer operations 

In order to allow the Microsoft Windows 95 DIB engine to render direct to the 
framebuffer in the 5:5:5:1 format, a special framebuffer bypass option is 
supported which presents the front and back buffers uninterleaved, i.e. as a 
5:5:5:1 1 6bpp packed framebuffer. This allows rarely used complex primitives 
to be rendered by software. 

3.4.4 Panning 

Display panning can be achieved by setting the ScreenBase and ScreenStride 
registers appropriately. The ScreenBase register defines where in the frame- 
buffer the image is to start. For panning to work, the image in the framebuffer 
must be larger than that to be displayed. The ScreenStride holds this 
difference in terms of 64-bit units per scanline. For example, with a screen 
width of 640 pixels and a framebuffer image width of 660, 32-bit pixels, the 
ScreenStride needs to be set to 10. 

An effective ScreenStride of 32 bits can be achieved by the additional use of 
the HgEnd register. Appropriate setting of this register allows selection 
between odd and even 32 bits. 

To achieve a finer granularity of panning than 32 bits (four 8 bit pixels, two 1 6 
bit pixels etc) a RAMDAC with additional panning features is required. 
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3.5 Texture Buffer 

The texture buffer is very similar to the framebuffer. Textures are stored in the 
formats the framebuffer supports, and loaded into memory through the Frame- 
buffer Write unit. If the texture format is different from the framebuffer format, 
the DitherMode register should be temporarily set to the texture format during 
texture loads. Textures are read through the Texture Read unit. 

If the texture is already in the correct format, a fast texture load can be used. 
This is done by writing raw texture data to the TextureData register. Raw data 
is 32 bits wide, with the correct bit pattern to be stored in memory. No data 
formatting or packing is done, so the texture must be pre-processed if this is 
required. The texture is stored linearly in memory from the address specified 
in Texture Down LoadOff set which is automatically incremented; no patching is 
done, so if the texture is to be patched it must be done by the host. This method 
avoids setting up the Rasterizer and changing the state of the pipeline. 

3.5.1 Texture Buffer Coordinates 

Texture coordinates are formed by the Texture Address unit and passed to the 
Texture Read unit. In place of the Rasterizer's X and Y coordinate system, the 
Texture Address unit generates S and T values. 

The actual equations used to calculate the texture buffer address are: 

Bottom left origin 

Texture address = TextureBaseAddress - T * W + S 

Top left origin 

Texture address = TextureBaseAddress + T * W + S 
where: 

S is the texel's S coordinate, 

T is the texel's T coordinate, 

TextureBaseAd- holds the base address in the framebuffer of the current window, 
dress 

W is the texture map width. Only a subset of widths are supported 
and these are encoded into the PPO, PP1 and PP2 fields in the 
TextureReadMode register. See the table in Appendix C for more 
details. 

These address calculations translate a 2D address into a linear address so 
non power of two texture widths (e.g. 640) are economical in memory. Note 
that the width of the texture map used for these calculations is independent of 
the width and height used for texture effects such as repeat or clamp. The 
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address is in texels; the physical byte address is calculated by multiplying the 
texel address by the number of bytes in the texel. 

3.5.2 Texture Color Formats 

Texture maps have the same choice of formats as the framebuffer plus YUV 
and 4 bit Color Index formats (see section 3.3.2 for details). The formats of the 
texture map and framebuffer do not have to be the same. 
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The TVP4010 provides a rich variety of operations for 2D and 3D graphics 
supported by its Hyperpipelined architecture. Section 4.1 shows the units in 
the HyperPipeline, section 4.2 shows how to render a simple graphic primitive, 
and sections 4.3 to 4.15 describe each unit. 
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4.1 The Graphics HyperPipeline 

The Graphics Hyperpipeline, or Graphics Processor, supports: 

□ Point, Line, Triangle and Bitmap primitives 

□ Flat and Gouraud shading 

□ Texture Mapping, Fog and Alpha blending 

□ Scissor and Stipple 

□ Stencil test, Depth (Z) buffer test 

□ Dithering 

□ Logical Operations 

The units in the HyperPipeline are: 

□ Rasterizer scan converts the primitive into a series of fragments. 

□ Scissor/Stipple tests fragments against a scissor rectangle and a stipple 
pattern. 

□ Localbuffer Read loads localbuffer data for use in the Stencil/Depth unit. 

□ Stencil/Depth performs stencil and depth tests. 

□ Texture Address generates addresses of texels for use in the Texture 
Read unit. 

□ Texture Read accesses texture values for use in the texture application 
unit. 

□ YUV converts YUV to RGB and applies chroma test. 

□ Localbuffer Write stores localbuffer data to memory. 

□ Framebuffer Read loads data from the framebuffer. 

□ Color DDA generates color information. 

□ Texture/Fog/Blend modifies color. 

□ Color Format converts the color to the external format. 

□ Logic Ops performs logical operations. 

□ Framebuffer Write stores the color to memory. 

□ Host Out returns data to the host. 
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Figure 4-1. Hyperpipeline 
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The order of the Hyperpipeline shows the order in which operations are 
performed. The Scissor/Stipple unit is before the texture address generator, 
so any fragments that fail a stipple test will not cause a texture access. This 
makes best use of the processing capacity of the pipeline. An awareness of 
the pipeline is important when programming the TVP4010; all units in the 
pipeline can be thought of as independent. For example, enabling the XOR 
logic op will not automatically enable reading from the framebuffer; this must 
be done explicitly. 
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4.2 A Gouraud Shaded Triangle 

In this section we show how to render a typical 3D graphics primitive, the 
Gouraud shaded, depth buffered triangle using the TVP4010. For this 
example, assume the coordinate origin is bottom left of the window and 
drawing will be from top to bottom. The TVP401 can draw from top to bottom 
or bottom to top. 

Consider a triangle with vertices, vi , V2 and V3 where each vertex comprises 
X, Y and Z coordinates, shown below. Each vertex has a different color made 
up of red, green and blue (R, G and B) components. 

Figure 4-2. Example Triangle 

(X 1 ,Y 1 ,Z 1 
R r G 1 ,B 1 ) 



v1 




^3 > G 3 - B 3 ) 



The diagram makes a distinction between top and bottom halves because the 
TVP4010 is designed to rasterize screen aligned trapezoids and flat-topped 
or bottomed triangles as shown below: 

Figure 4-3. Screen aligned trapezoid and flat topped triangle 



4.2.1 Initialization 

The TVP401 requires many of its registers to be initialized in a particular way, 
regardless of what is to be drawn ; for instance, the screen size and appropriate 
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clipping must be set up. Normally this only needs to be done once and for 
clarity this example assumes that all initialization has already been done. For 
more details on initialization, see Chapter 5. 

Other state will change occasionally, though not usually on a per primitive 
basis, for instance enabling Gouraud shading and depth buffering. A detailed 
treatment is given later in this chapter; details are not included here. 



4.2.2 Dominant and Subordinate Sides of a Triangle 

The dominant side of a triangle is that side with the greatest range of Y values. 
The choice of dominant side is optional when the triangle is either flat bottomed 
or flat topped. 

The TVP401 always draws triangles starting from the dominant edge towards 
the subordinate edges. This simplifies the calculation of setup parameters as 
described in the following paragraphs. 

Figure 4-4. Dominant and Subordinate Sides of a Triangle 




4.2.3 Calculating Color values for Interpolation 



To draw from left to right and top to bottom, the color gradients (or deltas) 
required are: 

Rc±Ra G q +Gh Bo±Bi 

"^13-^ dGd yi3=i^ o^-^y] 



And from the plane equation: 

dRdx={(R r R 3 )3 (Y 2~j3> } 3 ^ r ^ 3 h 

dGdx = { ( G r G 3 ) 3 2 a 3 I ~ { ( G 2~ G 3> 3 \ 3 } 
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dBdx = {(B r B 3 ) 3 (Y 2^3> } -{(Bz-BJ 3 ^ J 3 ) 
where: 

a= ABS({(X^ X 3 )HY 2 - Y 3 )} -{(X^- X 3 )*( Y\- Y 3 ) ) 

These values allow the color of each fragment in the triangle to be determined 
by linear interpolation. For example, the red component color value of a 
fragment at X n ,Y m could be calculated by: 

adding dRdy-i3, for each scanline between Y-| and Y n , to R-|. 

then, adding dRdx for each fragment along scanline Y n from the left edge 
toX n . 

The example chosen has the knee, i.e., vertex 2, on the right hand side, and 
drawing is from left to right. If the knee were on the left side (or drawing was 
from right to left), then the Y deltas for both the subordinate sides would be 
needed to interpolate the start values for each color component (and the depth 
value) on each scanline. For this reason, the TVP401 always draws triangles 
starting from the dominant edge towards the subordinate edges. For the 
example triangle, this means left to right. 

4.2.4 Register Set Up for Color Interpolation 

For the example triangle the TVP401 registers must be set as follows. Details 
of register formats are given later. 

// Load the color start and delta values to draw 
// a triangle 
RStart (Rl) 
GStart (Gl) 
BStart (Bl) 

dRdyDom (dRdyl3) // To walk up the dominant edge 
dGdyDom (dGdyl3) 
dBdyDom (dBdyl3) 

dRdx (dRdx) // To walk along the scanline 

dGdx (dGdx) 
dBdx (dBdx) 

4.2.5 Calculating Depth Gradient Values 

To draw from left to right and top to bottom, the depth gradients (or deltas) 
required for interpolation are: 
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dZdy 13 




And from the plane equation: 



dZdx={(Z 1 -Z 3 )3 (Y ^ Y 3[ 



-}-{(Z^Z 3 )3 



( Y 1 ~Y 3 ) 



a 



where 



a = ABS{((X 1 -X 3 )i (Y2-Y 3 )}-{(X2-X 3 ) ^ (Y1-Y3))} 



The divisor, shown here as a, is the same as for color gradient values. The two 
deltas, dZdy-13 and dZdx allow the Z value of each fragment in the triangle to 
be determined by linear interpolation as described for the color interpolation 
above. 



Internally the TVP4010 uses fixed-point arithmetic. The formats for each 
register are described later. Each depth value must be converted into a 
2s-complement fixed-point number and then loaded into the appropriate pair 
of registers. The Upper or U registers store the integer portion, while the Lower 
or L registers store the fractional bits, left justified and zero filled. 

For the example triangle, the TVP4010 needs its registers set up as follows: 

// Load the depth start and delta values 

// to draw a triangle 

ZStartU (Z1_MS) 

ZStartL (Z1_LS) 

dZdyDomU (dZdyl3_MS) 

dZdyDomL (dZdyl3_LS) 

dZdxU (dZdx_MS) 

dZdxL (dZdx_LS) 



4.2.6 Register Set Up for Depth Testing 



4.2.7 Calculating the Slopes for each Side 



The TVP401 draws filled shapes such as triangles as a series of spans with 
one span per scanline. Therefore it needs to know the start and end X 
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coordinate of each span. These are determined by a process called edge 
walking. This process involves adding one delta value to the previous span's 
start X coordinate and another delta value to the previous span's end X 
coordinate to determine the X coordinates of the new span. These delta values 
are in effect the slopes of the triangle sides. To draw from left to right and top 
to bottom, the slopes of the three sides are calculated as: 

13 Y3~ Y 1 12 Y 2~ Y 1 23 Y 3~ Y 2 



This triangle will be drawn in two parts, top down to the knee i.e., vertex 2, and 
then from there to the bottom. The dominant side is the left side, so for the top 
half: 

dXDom= dX < Q dXSub = dX, 



The start X,Y, the number of scanlines, and the above deltas give the TVP401 
enough information to edge walk the top half of the triangle. However, to 
indicate that this is not a flat topped triangle (the TVP4010 is designed to 
rasterize screen aligned trapezoids and flat topped triangles), the same start 
position in terms of X must be given twice as StartXDom and StartXSub. 

To edge walk the lower half of the triangle, selected additional information is 
required. The slope of the dominant edge remains unchanged, but the 
subordinate edge slope needs to be set to: 
dXSub= dX OQ 



Also the number of scanlines to be covered from Y2 to Y3 needs to be given. 
Finally to avoid any rounding errors accumulated in edge walking to X 2 (which 
can lead to pixel errors), StartXSub must be set to X2. 

4.2.8 Rasterizer Mode 

The TVP401 Rasterizer has a number of modes which remain effective from 
the time they are set until they are modified and can thus affect many 
primitives. In the case of the Gouraud shaded triangle, the default values for 
these modes are suitable. 

Rasteri zerMode (0) // Default Rasterizer mode 

4.2.9 Subpixel Correction 

The TVP401 can perform subpixel correction of all interpolated values when 
rendering aliased trapezoids. This correction ensures that any parameter 



4-8 



A Gouraud Shaded Triangle 



(color/depth/texture/fog) is correctly sampled at the center of a fragment. In 
general, subpixel correction will always be enabled when rendering any 
trapezoid which has interpolated parameters. Control of subpixel correction is 
in the Render command register described in the following section, and is 
selectable on a per primitive basis. It does not need to be enabled for any 
primitive that does not use interpolation, including copy operations. If it is 
disabled and interpolators are used, the values calculated for the primitive may 
not be exactly correct; enabling sub-pixel correction may reduce the 
performance of the chip, particularly for small primitives. 

4.2.10 Rasterization 

The TVP401 is almost ready to draw the triangle. Setting up the registers as 
described here and sending the Render command will cause the top half of the 
example triangle to be drawn. 

For drawing the example triangle, all the bit fields within the Render command 
should be set to except the PrimitiveType which should be set to trapezoid 
and the SubPixelCorrectionEnable bit which should be set to TRUE. 

// Draw triangle with knee 
// Set deltas 

StartXDom (X 1 <<16) // Converted to 16.16 fixed point 
dXDom ( ( (X 3 - X 1 )«16)/(Y 3 - Y x )) 
StartXSub (X 1 «16) 

dXSub ( ( (X 2 - X 1 )«16)/(Y 2 - Y x )) 
StartY (Y 1 «16) 
dY (-1«16) 
Count (Y-l - Y 2 ) 

// Set the render command mode 

render .PrimitiveType = TVP4010_TRAPEZOID_PRIMITIVE 
render . SubPixelCorrectionEnable = TRUE 
// Draw the top half of the triangle 
Render (render) 

After the Render command has been issued, the registers in theTVP4010can 
immediately be altered to draw the lower half of the triangle. Note that only two 
registers need be loaded and the command ContinueNewSub sent. Once the 
TVP4010 has received ContinueNewSub, drawing of this sub-triangle will 
begin. 

// Set-up the delta and start for the new edge 
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StartXSub (X 2 <<16) 

dXSub ( ( (X 3 - X 2 )«16)/(Y 3 - Y 2 )) 
// Draw sub-triangle 

ContinueNewSub (Y 2 - Y 3 ) // Draw lower half 
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4.3 Rasterizer Unit 

The Rasterizer decomposes a given primitive into a series of fragments for 
processing by the rest of the HyperPipeline. 

The TVP4010 can directly rasterize: 

□ aliased screen aligned trapezoids 

□ aliased single pixel wide lines 

□ aliased single pixel points 

All other primitives are treated as one or more of the above. 

4.3.1 Trapezoids 

The TVP4010 basic area primitive is the screen aligned trapezoid. This is 
characterized by having top and bottom edges parallel to the X axis. The side 
edges may be vertical (a rectangle), but in general will be diagonal. The top 
or bottom edges can degenerate into points in which case we are left with 
either flat topped or flat bottomed triangles. Any polygon can be decomposed 
into screen aligned trapezoids or triangles. Usually, polygons are decomposed 
into triangles because the interpolation of values over non-triangular polygons 
is ill defined. The Rasterizer does handle flat topped and flat bottomed 'bow 
tie' polygons which are a special case of screen aligned trapezoids. 

To render a triangle, the approach adopted to determine which fragments are 
to be drawn is known as edge walking. Suppose the aliased triangle shown in 
Figure 4-5 is to be rendered from top to bottom and the origin is bottom left 
of the window. Starting at (X1 , Y1 ) then decrementing Y and using the slope 
equations for edges 1-2 and 1-3, the intersection of each edge on each 
scanline can be calculated. This results in a span of fragments per scanline 
for the top trapezoid. The same method can be used for the bottom trapezoid 
using slopes 2-3 and 1-3. 

It is usually required that adjacent triangles or polygons which share an edge 
or vertex be drawn such that pixels which make up the edge or vertex get 
drawn only once. This may be achieved by omitting the pixels down the left or 
the right sides and the pixels along the top or lower sides. The TVP401 has 
adopted the convention of omitting the pixels down the right hand edge. 
Control over whether the pixels along the top or lower sides are omitted 
depends on the start Y value and the number of scanlines to be covered. With 
the example, if StartY = Y1 and the number of scanlines is set to Y1-Y2, the 
lower edge of the top half of the triangle will be excluded. This excluded edge 
will get drawn as part of the lower half of the triangle. 
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To minimize delta calculations, triangles may be scan converted from left to 
right or from right to left. The direction depends on the dominant edge, that is 
the edge which has the maximum range of Y values. Rendering always 
proceeds from the dominant edge towards the relevant subordinate edge. In 
the example above, the dominant edge is 1-3 so rendering will be from right 
to left. 

Figure 4-5. Rasterizing a triangle. 

Subordinate Edge 1-2 (X1,Y1) 




dXSub 2-3 



The sequence of actions required to render a triangle (with a knee) are: 

■ Load the edge parameters and derivatives for the dominant edge 
and the first subordinate edges in the first triangle. 

■ Send the Render command. This starts the scan conversion of 
the first triangle, working from the dominant edge. This means 
that for triangles where the knee is on the left we are scanning 
right to left, and vice versa for triangles where the knee is on the 
right. 

■ Load the edge parameters and derivatives for the remaining 
subordinate edge in the second triangle. 

■ Send the ContinueNewSub command. This starts the scan 
conversion of the second triangle. 

Pseudocode for the above example is: 

// Set the Rasterizer mode to the default, 
// see section 4.3.10 
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RasterizerMode (0) 

// Set-up the start values and the deltas. 

// Note that the X and Y coordinates are converted to 

// 16.16 format 

StartXDom (Xl<<16) 

dXDom ( ( (X3- X1)«16)/(Y3 - Yl)) 
StartXSub (Xl«16) 
dXSub ( ( (X2- X1)«16)/(Y2 - Yl)) 
StartY (Yl<<16) 

dY (-1<<16) // Down the screen 

Count (Yl - Y2) 

// Set the render mode to aliased primitive with 
// subpixel correction. See 4.3.6 

render . PrimitiveType = TVP4010_TRAPEZOID_PRIMITIVE 
render . SubpixelCorrectionEnable = TVP4010_TRUE 
// Draw top half of the triangle 
Render (render) 

// Set the start and delta for the second half of the 
// triangle. 
StartXSub (X2<<16) 

dXSub ( ( (X3- X2)<<16)/(Y3 - Y2 ) ) 
// Draw lower half of triangle 
ContinueNewSub (abs (Y2 - Y3)) 

After the Render command has been sent, the registers in the TVP401 can 
be altered immediately to draw the second half of the triangle. To do this, note 
that only two registers need to be loaded and the command ContinueNewSub 
to be sent. Once drawing of the first triangle is complete and the TVP401 has 
received the ContinueNewSub command, drawing of this sub-triangle starts. 
The ContinueNewSub command register is loaded with the remaining number 
of scanlines to be rendered. 

A Continue command can be used instead of the ContinueNewSub command 
in certain situations where it is beneficial to avoid reloading the Rasterizer's 
edge Digital Differential Analyzers (DDAs). However, accumulation of 
rasterization errors can occur which may result in imprecise rendering. 

The ContinueNewDom command can be used to draw complex 2D shapes as 
a series of trapezoids. Since this command only affects the Rasterizer DDA 
and not any other unit, it is not suitable for 3D operations. 
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4.3.2 Lines 

Single pixel wide aliased lines are drawn using a DDA algorithm, so all thatthe 
TVP401 needs by way of input data is StartX, StartY, dX, dY and length. The 
algorithm calculates: 

while (length--) 
{ 

X = X + dx 
Y = Y + dy 

plot ((int)X, (int)Y) 

} 

Consider rendering a two segment polyline (see Figure 4-6) from (X-,, Y-,) to 
(X 2 , Y 2 ) to (X 3 , Y 3 ) 

Both segments are X major so: 

abs (X n+1 -X n ) > abs (Y n+1 - Y n ) 

The pseudocode to render this line is shown below. 

Figure 4-6. Polyline 

(X2, Y2) 

(X1,Y1)" (X3,Y3) 

// Set the Rasterizer mode to the default, 
// see section 4.3.10 
RasterizerMode (0) 

// Load the delta values for the first segment. 

StartXDom (X 1 <<16) 

dXDom (1.0«16) 

StartY (Y 1 <<16) 

dY ( ( (Y 2 - Y 1 )«16) / (X 2 - X x ) ) 

Count (abs (X 2 - X x ) ) 

/ / Set the render mode 

render . Pr imit iveType = TVP4010_LINE_PRIMITIVE 
// Start rendering 
Render (render) 

// The first segment is complete, load delta 
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1 1 for the second 

dXDom (1.0<<16) 

dY ( ( (Y 3 - Y 2 )«16) / (X 3 - X 2 ) ) 

// Continue with the second segment 

ContinueNewLine (abs (X3 - X 2 )) 

Note that the mechanism to render the second segment with the Continue- 
NewLine command is analogous to the ContinueNewSub command used at 
the knee of a triangle. Care must be taken when a continue command is being 
used for lines. Incorrect rendering can occur with operations such as alpha 
blending and logical ops if a segment draws back over the previous line seg- 
ment thus attempting to reuse pixels that have just been updated. The solution 
is to send a Sync prior to the ContinueNewLine. This will ensure pending writes 
are flushed before the framebuffer reads the new line segment. Note that there 
is no need to poll for the Sync here; the act of loading this command register 
is sufficient. 

When a Continue command is used rather than a ContinueNewLine, some 
error will be propagated along the line so this is rarely used for lines. To 
minimize these errors, a choice of actions are available as to how the DDA 
units are restarted on the receipt of a ContinueNewLine command, see 
subsection 4.3.10. 

It is recommended that for OpenGL rendering, the ContinueNewLine 
command is not used and individual segments are rendered. 

4.3.3 Points 

The TVP401 supports a single pixel aliased point primitive. For points larger 
than one pixel trapezoids should be used. The fields in the Render command 
register are described in detail later; however, in this case the Primitive Type 
field in the Render command should be set to equal the 
TVP401 0_POINT_PRIMITIVE. The pseudocode portion to render an aliased 
unity sized point is: 

// Set the Rasterizer mode to the default, 
// See section 4.3.10 
RasterizerMode (0) 

// Set-up the start values and the deltas. 

// Note that the X and Y coordinates are converted to 

// 16.16 format 

StartXDom (X<<16) 
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StartY (Y«16) 

// Set-up the render command. 

render . Pr imit iveType = TVP 4 1 0_POINT_PRIMITIVE 
/ / Render the point 
Render (render) 

4.3.4 Spans 

Shapes more complex than points, lines or trapezoids may be drawn as a 
series of spans. Each span may be drawn as a horizontal line or as a single 
pixel high trapezoid. Both are special cases of subsections 4.3.2 and 4.3.3 in 
that the loading of certain registers may be omitted e.g., dXDom, dXSub and 
dY. However, trapezoids can optionally use block writes for constant color 
spans and so may be preferable. 

4.3.5 Block Write Operation 

The TVP401 supports SGRAM block writes with block sizes of 32 pixels. Any 
screen aligned trapezoid can be filled using block writes, not just rectangles. 
The SGRAM hardware writemasks can be used in conjunction with block 
writes. 

The use of block writes is enabled by setting the FastFillEnable field in the 
Render command register. 

Note only the Rasterizer and Framebuffer Write units are involved in block 
filling. The other units will ignore block write fragments, so it is not necessary 
to disable them. 

4.3.6 Sub Pixel Precision and Correction 

As the Rasterizer has fractional precision of 1 4 bits in Y and 1 5 bits in X, and 
the maximum screen width is 1536 pixels wide a number of bits, called 
subpixel precision bits, are available. The extra bits are required for a number 
of reasons: 

■ when using an accumulation buffer (where scans are rendered 
multiple times with jittered input vertices) 

■ for correct interpolation of parameters to give high quality shading as 
described below 

The TVP4010 supports subpixel correction of interpolated values when 
rendering trapezoids. Subpixel correction ensures that all interpolated 
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parameters associated with a fragment (color, depth, fog, texture) are correctly 
sampled at the fragment's center. This correction is required to ensure 
consistent shading of objects made from many primitives. It should generally 
be enabled for all rendering which uses interpolated parameters. 

4.3.7 Bitmaps 

A Bitmap primitive is a trapezoid or line of ones and zeros which control which 
fragments are generated by the Rasterizer. Only fragments where the 
corresponding Bitmap bit is set are submitted for drawing. The normal use for 
this is in drawing characters, although the mechanism is available for all 
primitives. The Bitmap data is packed contiguously into 32-bit words so that 
rows are packed adjacent to each other. Bits in the mask word are by default 
used from the least significant end towards the most significant end and are 
applied to pixels in the order they are generated in. The relationship between 
bits in the mask and the scanning order is shown in Figure 4-7. 

Instead of rejecting fragments which fail the bitmask, they may be set to the 
background color. This is controlled by the RasterizerMode register. The 
background color comes from the TexelO register, which may be static or 
dynamically loaded through the Texture Read unit. 

The Rasterizer scans through the bits in each word of the Bitmap data and 
increments the X,Y coordinates to trace out the rectangle of the given width 
and height. By default, any set bits (1) in the Bitmap cause a fragment to be 
generated, any reset bits (0) cause the fragment to be rejected. 

Figure 4-7. Relationship between Bitmask and Scanning Directions 
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The selection of bits from the BitMaskPattern register can be mirrored, that is, 
the pattern is traversed from MSB to LSB rather than LSB to MSB. Also, the 
sense of the test can be reversed such that a set bit causes a fragment to be 
rejected and vice versa. This control is found in the RasterizerMode register, 
described in subsection 4.3.10. 
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When one Bitmap word has been exhausted and pixels in the rectangle still 
remain then rasterization is suspended until the next write to the 
BitMaskPattern register, or the bitmask can be reused. If the bitmask is still 
valid when a new line is started it can continue to the next line or be discarded 
and a new one started; the start position of the mask can be specified to allow 
the first bits to be ignored. It is also possible to index into the mask using the 
X position of the Rasterizer. This allows 32-bit wide window aligned bit pattern; 
used with a new mask for every scanline a 32 x 32 stipple pattern can be 
supported. 

For example a five-pixel wide, eight-pixel high bitmap requires a register set 
up as follows: 

// Set the Rasterizer mode to the default, 
// see section 4.3.10 
RasterizerMode (0) 

// Set-up the start values and the deltas. 

/ / Note that the X and Y coordinates are converted to 

// 16.16 format 

StartXDom (X<<16) 

dXDom (0) 

StartXSub ((X + 5)<<16) // Right hand edge pixels 

// get missed off. 

StartY (Y«16) 
dY (1«16) 
Count (8) 

// At least the following bits reguire setting for the 
// Render command. 

render . PrimitiveType = TVP4010_TRAPEZOID_PRIMITIVE 

render . SyncOnBitMask = TVP4010_TRUE 

render . ReuseBitMask = TVP4010_FALSE 

// Issue render command. First fragment will be 

/ / generated on receipt of the BitMaskPattern 

Render (render) 

// 8x5 pixel bitmap requires 40 bits, and so 2 
// 32 bit words. 
BitMaskPattern (patternWordO ) 
BitMaskPattern (patternWordl ) 
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Rendering starts as soon as the first patternWord is loaded into the BitMask- 
Pattern register. 

4.3.8 Block Writes and Bitmaps 

The fastest way to render downloaded bitmap data, not requiring logical op 
processing, is to use block fills. The Rasterizer is set up as normal, setting the 
FastFillEnable bit. If it is also necessary to plot the background color then, the 
operation should be repeated for the background color but with the InvertBit- 
Mask bit set in the RasterizerMode register. 

Since the downloaded bitmask data will be ANDed with masks generated by 
the Rasterizer without any re-alignment being performed, it is up to the host 
software to ensure that the masks match up. This can be achieved in two ways. 
First, the host software can align the bits that it downloads to match the 
alignment of the Rasterizer. A faster way is to use the User Scissor. This is the 
recommended method. Note that this is a general algorithm. In the special 
case where the data to be downloaded is already aligned to 32 bits on both the 
left and right edges, the scissor need not be used. 

For example, suppose that we want to download data to fill a rectangle with 
left edge at 1 and right edge at 200. And further, assume that the host bitmap 
data is to be loaded from an offset of 35 within the bitmap. Our goal is to match 
the bit at offset 35 with the pixel at offset 10. 

Since we want to do the least amount of work on the host by avoiding shifting 
the data, we will actually download the host bitmap data at the previous 32-bit 
boundary. This means that we must setup the TVP4010 to discard the first 
three bits of data. We achieve this by rasterizing a rectangle whose left edge 
is three pixels less than that required, in this case we would rasterize the left 
edge to start at pixel 7. This causes the source bitmap data to be correctly 
aligned with the mask data produced by the Rasterizer. But, in order to protect 
the three pixels that we would otherwise overwrite, we use the scissor clip and 
set its bounds to be those of the original rectangle. 

When using a block write operation like this, the Rasterizer will wait for new 
bitmask data to be downloaded at the start of each scanline. So we do not have 
to perform the alignment operation on the right hand edge. 

A similar algorithm can be used to implement fast text rendering. For example, 
for fonts where each line fits into 32 bits, each line of a glyph can be down- 
loaded as a mask. 

Block writes can be used in combination with bitmasks with InvertBitMask and/ 
or MirrorBitMask options but not BitMaskOffset or BitMaskPacking. 
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4.3.9 Copy/Upload/Download 

The TVP4010 supports three pixel-rectangle operations: copy, upload and 
download. These can apply to all buffer types. 

Typically, a the TVP4010 copy moves raw blocks of data around buffers. To 
zoom or re-format data, either external software must upload the data, process 
it and then download it again, or the texture part of the Texture/Fog/Blend unit 
should be used. 

To copy a rectangular area, the Rasterizer would be configured to render the 
destination rectangle, thus generating fragments for the area to be copied. The 
TVP4010 copy works by adding a linear offset to the destination fragment 
address to find the source fragment address. The calculation of the offset 
value is as shown in Figure 4-8. 

Note that the offset is independent of the origin of the buffer or window, as it 
is added to the destination address. Care must be taken when the source and 
destination overlap to choose the source scanning direction so that the over- 
lapping area is not overwritten before it has been moved. This may be done 
by swapping the values written to the StartXDom and StartXSub, or by 
changing the sign of dY and setting StartY to be the opposite side of the 
rectangle. 

Figure 4-8. TVP4010 Copy Operation 



Screen Width 




Offset = Y Offset * Screen Width + X Offset 
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The TVP401 buffer upload/downloads are very similar to copies in that the 
region of interest is generated in the Rasterizer. However, the localbuffer and 
framebuffer are generally configured to read or to write only, rather than both 
read and write. The host out unit should be set to output data to the FIFO for 
image uploads. For downloads, the Rasterizer should be set to sync on the 
appropriate data type. This means that the Rasterizer will not generate the 
next fragment address until data is supplied from the host processor. 

Units which can generate fragment values, the Color DDA unit for example, 
should generally be disabled for any copy/upload/download operations. 

Warning: During image upload, all the returned fragments must be read from the Host Out FIFO, 
otherwise the TVP4010 pipeline will stall. In addition it is strongly recommended that any units 
which can discard fragments (for instance the following tests: bitmask, user scissor, screen 
scissor, stipple, depth, stencil), are disabled otherwise a shortfall in pixels returned may occur, 
also leading to deadlock. 

Note that because the area of interest in copy/upload/download operations is 
defined by the Rasterizer, it is not limited to rectangular regions. 

Color formatting can be used when performing image copies, uploads and 
downloads. This allows data to be formatted from, or to any of the supported 
the TVP401 color formats; subsection 4.1 2.6 fully describes this operation. 

4.3.10 Rasterizer Mode 

A number of long-term modes can be set using the RasterizerMode register, 
these are: 

□ Mirror BitMask: This is a single bit flag which specifies the direction that 
bits are checked in the BitMaskPattern register. If the bit is reset, the 
direction is from least significant to most significant (bit to bit 31 ), if the 
bit is set, it is from most significant to least significant (from bit 31 to bit 0). 

□ Invert BitMask: This is a single bit which controls the sense of the 
accept/reject test when using a Bitmask. If the bit is reset then when the 
BitMask bit is set the fragment is accepted and when it is reset the 
fragment is rejected. When the bit is set the sense of the test is reversed. 

□ Fraction Adjust: These two bits control the action taken by the Rasterizer 
on receiving a ContinueNewLine command. As the TVP401 uses a DDA 
algorithm to render lines, an error accumulates in the DDA value. The 
TVP4010 provides for greater control of the error by doing one of the 
following: 

■ leaving the DDA running, which means errors will be propagated 
along a line. 
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■ or setting the fraction bits to either zero, a half or almost a half 
(0x7FFF). 

□ Bias Coordinates: Only the integer portion of the values in the DDAs are 
used to generate fragment addresses. Often the actual action required is 
a rounding of values. This can be achieved by setting the bias coordinate 
bit to true which will automatically add almost a half (0x7FFF) to all input 
coordinates. 

□ ForceBackgroundColor: When set, if a fragment fails the bitmask test it is 
not discarded, but it is made to use the contents of the TexelO register in 
place of the normal color. This is used to provide foreground/background 
color selection. 

□ BitMaskByteSwapMode. This controls how or whether the bitmask is byte 
swapped as it is loaded. Four different byte orders are supported. 

□ BitMaskPacking. Controls whether a bitmask is discarded at the end of a 
scanline or continued onto the next. Not supported for block writes. 

□ BitMaskOffset. Sets the position of the first bit in the bitmask to test. Not 
supported for block writes. 

□ HostDataByteSwapMode. Controls byte swapping of host data being sent 
to the chip. This applies to any operation using the SyncOnHostData in the 
Render register. Four different byte orders are supported. 

□ LimitsEnable. When enabled, this allows quick rejection of fragments 
outside the defined area. 

□ BitMaskRelative. If enabled, this specifies that the bitmask should be 
accessed by an index made up of the lower 5 bits of the X coordinate of 
the current fragment. 

4.3.11 Synchronization 

For most circumstances the TVP4010 automatically synchronizes between 
primitives so that data for the first primitive is written before data for the second 
primitive is read. This is handled by data type, so localbuffer reads and writes 
are synchronized as are framebuffer reads and writes, but localbuffer reads 
are not synchronized with framebuffer writes. 

If a unit is used to modify data that is not its normal type, then it may be 
necessary to explicitly synchronize the pipeline. If the Framebuffer Write unit 
is used to clear the localbuffer with block fills then the pipeline must be 
synchronized before localbuffer data is read. If the Framebuffer Write unit is 
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used to download a texture map, the pipeline must be synchronized before the 
Texture Read unit accesses the texture. 

Explicit synchronization of the pipeline is done by the WaitForCompletion 
command. This has no data field, and may be inserted into a stream of 
commands; there is no need to wait for the TVP4010 to report that 
synchronization has taken place. 

Alternatively, synchronization must be done with the Sync command, but this 
requires the host processor to poll the chip until it reports that the pipeline is 
idle (see section 4.15, Host Out Unit). 

4.3.12 X and Y limits clipping 

The Rasterizer will normally rasterize all pixels on every scanline, generating 
a fragment per pixel. If large numbers of scanlines are subsequently clipped 
out by, for example, the scissor unit, then a lot of time can be wasted. The Y 
limits register has been added to provide a way of quickly eliminating whole 
scanlines for a given primitive. This register effectively provides a Y scissor clip 
in the Rasterizer. 

If limits testing has been enabled in the RasterizerMode register, and if a 
scanline being rasterized falls outside the Y limits bounds, then the Rasterizer 
will move directly onto the next scanline without rasterizing in X. 

The Xlimits register has been added to avoid unnecessary rasterization, but 
does not act as a true X scissor clip. This is to ensure correct interpolation of 
color, fog etc. The limits registers are provided for efficiency reasons. 

Both X and Y Limits clipping are automatically disabled when 
SyncOnHostData or SyncOnBitMask is used. 

4.3.13 Registers 

Real coordinates with fractional parts are provided to the Rasterizer in 2s 
complement fixed point. The point is kept consistent with a 16.16 format even 
though some of the integer and fractional bits may not be significant. The 
integer portion should be sign extended to fill unused bits; unused bits in the 
fraction should be set to zero. 

Figure 4-9. Real Coordinate Representation 
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When reference is made to Signed-Fixed-Point Format, the sign bit is included 
in the integer section. For example, a signed-fixed-point format of 12.15 
implies 1 sign bit followed by 1 1 integer bits and 1 5 fraction bits. 

Table 4-1. Rasterizer Command Registers 



Register Name Data Field Description 



Render 


See below 


Starts the rasterization process 


ContinueNewDom 


11 bit integer 


Allows the rasterization to continue with a new dominant 
edge. The dominant edge DDA is reloaded with the new 
parameters. The subordinate edge is carried on from the 
previous trapezoid. This allows any convex polygon to be 
broken down into a collection of trapezoids, with continuity 
maintained across boundaries. Since this command only 
affects the Rasterizer DDA and not that of any other units, it is 
not suitable for 3D operations. 

The data field holds the number of scanlines to fill. Note this 
count does not get loaded into the Count register. 


ContinueNewSub 


11 bit integer 


Allows the rasterization to continue with a new subordinate 
edge. The subordinate DDA is reloaded with the new 
parameters. The dominant edge is carried on from the 
previous trapezoid. This is useful when scan converting 
triangles with a 'knee' (i.e. two subordinate edges). 
The data field holds the number of scanlines to fill. Note this 
count does not get loaded into the Count register. 


Continue 


11 bit integer 


Allows the rasterization to continue after new delta value(s) 
have been loaded, but does not cause either of the primitive's 
edge DDAs to be reloaded. This can result in the 
accumulation of rasterization errors causing imprecise 
rendering. 

The data field holds the number of scanlines to fill. Note this 
count does not get loaded into the Count register. 


ContinueNewLine 


11 bit integer 


Allows the rasterization to continue for the next segment in a 
polyline. The XY position is carried on from the previous line, 
however the fraction bits in the DDAs can be: kept, set to 
zero, half, or nearly one half, under control of the Rasterizer- 
Mode. 

The data field holds the number of pixels in a line. Note this 
count does not get loaded into the Count register. 
The use of ContinueNewLine is not recommended for 
OpenGL because the DDA units will start with a slight error as 
compared with the value they would have been loaded with 
for the second and subsequent segments. 
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Table 4-1. Rasterizer Command Registers(Continued) 
Register Name Data Field Description 

WaitForCompletion Not used This is used to suspend the TVP4010 core until all 

outstanding reads and writes in framebuffer memory units 
have completed. This is intended to prevent a new primitive 
from starting to be rasterized before the previous primitive is 
completely finished. It would be used, for example, to 
separate texture downloads from the surrounding primitives. 
The same functionality can be achieved using the Sync 
command and waiting for it in the Host Out FIFO. However, 
using WaitForCompletion doesn't involve the host and can be 
inserted into a DMA buffer. 
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Table 4-2. Rasterizer Control Registers 



RasterizerMode 



See below 



Defines the long term mode of operation of the Rasterizer. 



StartXDom 



dXDom 



StartXSub 
dXSub 
StartY 
dY 

Count 
XLimits 



YLimits 



Signed fixed point 
12.15 format 

Signed fixed point 
12.15 format 



Signed fixed point 
12.15 format 

Signed fixed point 
12.15 format 

Signed fixed point 
11 .14 format 

Signed fixed point 
11 .14 format 

11 bit integer 

Xmax: 2s comple- 
ment 12 bit value in 
the upper word. 
Xmin: 2s comple- 
ment 12 bit value in 
the lower word. 

Ymax: 2s comple- 
ment 12 bit value in 
the upper word. 
Ymin: 2s comple- 
ment 12 bit value in 
the lower word. 



Initial X value for the dominant edge in trapezoid filling, or initial 
X value in line drawing. 

Value added when moving from one scanline to the next for the 
dominant edge in trapezoid filling. 

Also holds the change in X when plotting lines so for Y major 
lines this will be some fraction (dx/dy), otherwise it is normally ± 
1 .0, depending on the required scanning direction. 

Initial X value for the subordinate edge. 

Value added when moving from one scanline to the next for the 
subordinate edge in trapezoid filling. 

Initial scanline in trapezoid filling, or initial Y position for line 
drawing. 

Value added to Y to move from one scanline to the next. For X 
major lines this will be some fraction (dy/dx), otherwise it is 
normally ± 1.0, depending on the required scanning direction. 

Number of pixels in a line. Number of scanlines in a trapezoid. 

Defines the X extents that the Rasterizer should fill between. A 
span is rasterized if its X value satisfies: 
Xmin £ X < Xmax 



Defines the Y extents that the Rasterizer should fill between, 
scanline is filled if its Y value satisfies: 
Ymin £ Y < Ymax 
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For efficiency, the Render command register has a number of bit fields that can 
be set or cleared per render operation, and which qualify other state 
information within the TVP4010. These bits are AreaStippleEnable, 
TextureEnable, FogEnable, ReuseBitMask and SubpixelCorrection. 

One use of this feature can occur when a window is cleared to a background 
color. For normal 3D primitives, stippling and fog operations may have been 
enabled, but these are to be ignored for window clears. Say that initially the 
FogMode and AreaStippleMode registers are enabled through the unit Enable 
bits. Now bits need only be set or cleared within the Render command to 
achieve the required result, removing the need to load the FogMode and 
AreaStippleMode registers for every Render operation. 

The bit fields of the Render command register are shown in Table 4-3: 
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Table 4-3. Render Command Register Fields 
Bit No. Name Description 

AreaStippleEnable Enables area stippling of the fragments produced during rasterization. 

The Stipple Unit must be enabled as well for stippling to occur. When 
this bit is reset no area stippling occurs irrespective of the setting of 
the area stipple enable bit in the Stipple Unit. 

1 ,2 Reserved 

3 FastFillEnable Causes fast block filling of primitives. When reset the normal 

rasterization process occurs. 

4, 5 Reserved 

6, 7 PrimitiveType Selects the primitive type to rasterize: 

= Line 

1 = Trapezoid 

2 = Point 

8,9,10 Reserved 

11 SyncOnBitMask Applies the bitmask test. If ReuseBitMask is disabled, the Rasterizer 

will wait for a new mask when the current one expires. If any other 
register is written while the rasterization is suspended, then the 
rasterization operation is aborted. The register write which caused the 
abort is then processed as normal. 

The behavior is slightly different when the SyncOnHostData bit is set 
to prevent a deadlock from occurring. In this case the rasterization 
doesn't suspend when all the bits have been used and if new 
BitMaskPattern data words are not received in a timely manner then 
the subsequent fragments will just reuse the bitmask. 

When this bit is set a fragment is produced only when one of the 
following registers has been written by the host: Depth, FBData, 
FBSourceData, Stencil, Color or TexelO. If SyncOnBitMask is reset, 
then if any register other than one of these six is written to, the 
rasterization operation is aborted. If SyncOnBitMask is set then if any 
register other than one of these six, or BitMaskPattern is written to, the 
rasterization is aborted. The register write which caused the abort is 
then processed as normal. Writing to the BitMaskPattern register 
doesn't cause any fragments to be generated. 

Enables texturing of the fragments produced during rasterization. 
Note that the texture units must also be enabled for any texturing to 
occur. 

14 FogEnable Enables fogging of the fragments produced during rasterization. Note 

that the fog unit must be enabled as well for any fogging to occur. 

1 5 Reserved 



12 SyncOnHostData 



13 TextureEnable 
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Table 4-3. Render Command Register Fields(Continued) 



Bit No. Name 



Description 



16 SubPixelCorrection 
Enable 



17 ReuseBitMask 



Enables the sub pixel correction of the color, depth, fog and texture 
values at the start of a scanline span. When this bit is reset no 
correction is done at the start of a span. Sub pixel corrections are only 
applied to aliased trapezoids. 

Causes the Rasterizer to reuse the current bitmask when it expires if 
SyncOnBitMask is enabled. 
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Several long-term Rasterizer modes are stored in the RasterizerMode register 
as shown in Table 4-4: 



Table 4-4. Rasterizer Mode Register 



Bit No. Name 



Description 







1 

2,3 



4,5 



6 

7,8 



10.. 14 
15,16 



MirrorBitMask 



InvertBitMask 
FractionAdjust 



BiasCoordinates 



ForceBackgroundColor 
BitMaskByteSwapMode 



BitMaskPacking 



BitMaskOffset 



When this bit is set the bitmask bits are consumed from the most 
significant end towards the least significant end. 
When this bit is reset the bitmask bits are consumed from the least 
significant end towards the most significant end. 

When this bit is set the bitmask is inverted first before being tested. 

These bits are for the ContinueNewLine command and specify how 

the fraction bits in the Y and XDom DDAs are adjusted: 

0: No adjustment is done 

1 : Set the fraction bits to zero 

2: Set the fraction bits to half 

3: Set the fraction to nearly half, i.e. 0x7fff 

These bits control how much is added onto the StartXDom, StartXSub 
and StartY values when they are loaded into the DDA units. The 
original registers are not affected: 
0: Zero is added 
1 : Half is added 

2: Nearly half, i.e. 0x7fff is added 

This bit, when set, causes the color to be taken from the TexelO 
register instead of the normal color if the bitmask test fails. 

Controls byte swapping of the bitmask. If input is ABCD, 

0:ABCD 

1 : BADC 

2: CDAB 

3: DCBA 

If enabled, the current bitmask is discarded at the end of every 
scanline even if it has not been finished. 
0: Enabled 
1 : Disabled 

Position of first bit to test in bitmask. 



HostdataByteSwapMode Controls byte swapping of host data. If input is ABCD, 
0: ABCD 
1 : BADC 
2: CDAB 
3: DCBA 



17 



Reserved 
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Table 4-4. Rasterizer Mode Register (Continued) 



Bit No. 


Name 


Description 


18 


LimitsEnable 


If enabled, quickly reject areas of primitive outside defined area. 






0: Enabled 






1 : Disabled 


19 


BitMaskRelative 


Controls whether bitmask is indexed by counter or by lower 5 bits of X 






value. 






0: Disabled 






1 : Enabled 



The register BitMaskPattern simply holds the 32-bit mask for bit mask stippling. 
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4.4 Scissor/Stipple Unit 

Two scissor tests are provided in the TVP401 0: the User Scissor test and the 
Screen Scissor test. The user scissor checks each fragment against a user 
supplied scissor region ; the screen scissor checks that the fragment lies within 
the screen. The stipple test checks each fragment against an 8 x 8 pattern. 

4.4.1 User Scissor Test 

The user scissor test, tests each fragment as follows: 

XMin <= X < XMax 
YMin <= Y < YMax 

Where X and Y are the coordinates for the fragments, and XMin, XMax, YMin 
and YMax define the user supplied scissor region. When a fragment fails the 
test, it is discarded. The test may be screen or window relative. This test 
applies to normal pixels and block fill operations. 

4.4.2 Screen Scissor Tests 

This test ensures that a pixel lies within the screen boundaries. For each 
fragment the XY origin stored in the WindowOrigin register is added to the 
fragment coordinates and this is tested against the screen boundaries stored 
in the ScreenSize register. Since the X and Y coordinates are held as 2s 
complement numbers, the window origin can be moved off the edges of the 
screen. 



The following test is made: 




<= (X + WX) < sw 




<= (Y + WY) < SH 




Where: 




X = Fragment X coordinate 


WX = Window origin X 
coordinate 


Y = Fragment Y coordinate 


WY = Window origin Y 
coordinate 


SW = Screen Width 




SH = Screen Height 





Figure 4-1 below shows a simple scenario of a screen with a single window 
which has a user defined scissor region. The shaded area shows the region 
where fragments pass the user and screen scissor tests and so can progress 
in the pipeline. Fragments outside this region are culled from the pipeline. This 
test applies to normal pixels and block fill operations. 
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Figure 4-10. Screen Scissor and User Scissor Tests 




This test may reject fragments when some part of a window has been moved 
off the screen. It will not reject fragments when part of a window is simply 
overlapped by another window. 

4.4.3 Area Stippling 

An 8 x 8 bit area stipple pattern can be applied to fragments. The least 
significant 3 bits of the fragment's (X,Y) coordinates, index into a 2D stipple 
pattern. If the selected bit in the pattern is set, then the fragment passes the 
test, otherwise it is rejected. In addition the bit pattern can be inverted or 
mirrored. Inverting the bit pattern has the effect of changing the sense of the 
accept/reject test. If the mirror bit is set the most significant bit of the pattern 
is towards the left of the window, the default is the converse. 

In some situations window relative stippling is required but coordinates are 
only available screen relative. To allow window relative stippling, an offset is 
available which is added to the coordinates before indexing the stipple table. 
X and Y offsets can be controlled independently. 

If the ForceBackgroundColor bit is set in the AreaStippleMode register, 
fragments which fail the area stipple test are not discarded. Instead, the 
contents of the TexelO register are used in place of the normal color for that 
pixel. 
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Area stippling is enabled using the AreaStippleMode register and must be 
qualified by the AreaStippleEnable bit in the Render command register. 
Stippling is only applied to normal pixels and has no effect on block fills. 

4.4.4 Registers 

The scissor operation is controlled by the ScissorMode register as shown in 
Figure 4-11 : 

Figure 4-11. Scissor Mode Register 

31 24 16 8 

— I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — 

Reserved 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I I 

Screen scissor enable 
User scissor enable 

Normally, the screen scissor test were always be enabled. The most common 
exception is during image upload. 

The user scissor region is specified by two registers ScissorMinXY and 
ScissorMaxXY the X values are stored in the least significant 16 bits of the 
register, the Y values in the most significant 16 bits of the register. 

The WindowOrigin register has the X coordinate of the origin stored in the least 
significant 16 bits of the register, and the Y coordinate in the most significant 
16 bits of the register. As each fragment is generated by the Rasterizer unit, 
this origin is added to the coordinates of the fragment to generate its screen 
coordinates. 

The ScreenSize register specifies the screen width and height, with the width 
in the least significant 1 6 bits and the height in the most significant 1 6 bits. 

The area stipple operation is controlled by the AreaStippleMode register as 
shown in Figure 4-12: 
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Figure 4-12. AreaStippleMode Register 

31 24 



16 



I I I I I I I I I I 
Reserved 




I I 

II 


I 


I I 
Y Offset 


I I I I I I I I I I I 
X Offset Reserved j 









i 


i i 


i I i i I i i i i i I 



ForceBackgroundColor Not used Not used Enable Unit 

MirrorY 
MirrorX 
Invert Stipple Pattern 

The EnableUnit bit is qualified by the AreaStippleEnable bits in the Render 
command register. The area stipple is set up in the AreaStipple Pattern n 
register, where n represents an integer between and 7. 



4.4.5 Scissor Example 



To enable screen scissor for a region: 10 <= X < 500, 100 <= Y < 200 with a 
screen size of 1 280x1 024 and the window origin at (1 00,1 00). 

// Set the screen size 
screenSize . Width = 1280 
screenSize . Height = 1024 
ScreenSize (screenSize) 
// Set the window origin 
windowOrigin . X = 100 
windowOrigin . Y = 100 
// Set-up the user scissor values 
minXY.X =10 
minXY.Y = 100 
maxXY.X = 500 
maxXY.Y = 200 

ScissorMinXY (minXY) // Load the registers 

ScissorMaxXY (maxXY) 
// Enable the unit 

scissorMode .UserScissorEnable = TVP4 010_ENABLE 
scissorMode . ScreenScissorEnable = TVP4 010_ENABLE 
ScissorMode (scissorMode) 
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1 1 Render primitives 

4.4.6 Area Stipple Example 

A repeating area stipple pattern of 2 x 2 pixels producing a 50% gray area: 

AreaStipplePatternO (OxAA) 
AreaStipplePatternl (0x55) 
AreaStipplePattern2 (OxAA) 
AreaStipplePattern3 (0x55) 
AreaStipplePattern4 (OxAA) 
AreaStipplePattern5 (0x55) 
AreaStipplePattern6 (OxAA) 
AreaStipplePattern7 (0x55) 
// Set-up mode register 

areaStippleMode.UnitEnable = TVP4 010_ENABLE 
areaStippleMode . XOf f set = 
areaStippleMode . YOf f set = 
areaStippleMode . Invert = 
areaStippleMode .MirrorY = 
areaStippleMode . MirrorX = 
// Load mode register 
AreaStippleMode (areaStippleMode) 

// When issuing a Render command, the AreaStippleEnable 
// bit should be set in addition to the area stipple test 
// being enabled: 

// render .AreaStippleEnable = TVP4010_TRUE 
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4.5 Localbuffer Read and Write Units 

The localbuffer holds the Stencil and Depth data associated with a fragment. 
Although separate units in the Hyperpipeline, the localbuffer read and write 
units are best considered as a pair. 

4.5.1 Localbuffer Read 

The LBReadMode register can be configured to make 0, 1 , or 2 reads of the 
localbuffer. The following are the most common modes of access to the local- 
buffer: 

□ Normal rendering without depth or stencil testing. This requires no local- 
buffer reads or writes. 

□ Normal rendering with depth and/or stencil testing required which 
conditionally requires the localbuffer to be updated. This requires 
localbuffer reads and writes to be enabled. 

□ Copy operations. Operations which copy all or part of the localbuffer. This 
requires reads and writes enabled. 

□ Upload/download operations. Operations which download depth or 
stencil information to the localbuffer, or read back depth or stencil values 
from the localbuffer to the host. 

The address calculation implements the following equations: 

Bottom left origin : 

Destination address = LBWindowBase - Y * W + X 

Source address = LBWindowBase - Y * W + X + LBSour- 
ceOf f set 

Top left origin : 

Destination address = LBWindowBase + Y * W + X 

Source address = LBWindowBase + Y * W + X + LBSour- 
ceOf f set 
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where: 

Destination address is the address any write will be made to and any destination read will 
be made from. 

is the address a source read will be made from, 
is the pixel's X coordinate, 
is the pixel's Y coordinate. 

holds the base address in the localbuffer of the current window. 

is normally zero except during a copy operation where data is read 
from one address and written to another address. The offset from 
destination to source is held in the LBSourceOffset register. 

is the screen width. Only a subset of widths are supported and these 
are encoded into the PPO, PP1 and PP2 fields in the LBReadMode 
register. See the table in Appendix C for more details. 

The localbuffer can be read in three formats: LBDefault, LBStencil or LBDepth. 
These tell the TVP401 which areas of the localbuffer are required. LBDefault 
is used for all copy and rendering operations, LBStencil and LBDepth are used 
for image upload of the Stencil and Depth planes. Table 4-5 summarizes the 
common rendering operations and the read modes required for them: 



Table 4-5. 


Localbuffer Read/Write Modes 






ReadSource 


ReadDestination 


Writes 


Data Type 


Rendering Operation 


Disabled 


Disabled 


Disabled 




Rendering with no Depth or Stencil en- 
abled. 


Disabled 


Disabled 


Enabled 


LBStencil 
LBDepth 


Download to localbuffer from host 


Disabled 


Enabled 


Disabled 


LBStencil 
LBDepth 


Upload from localbuffer to host 


Disabled 


Enabled 


Enabled 


LBDefault 


Rendering with depth and/or stencil up- 
dates enabled. 


Enabled 


Disabled 


Enabled 


LBDefault 


Localbuffer copy operations . 



Incorrect data can be read if reads are enabled but the same data had just 
been written with reads disabled. To avoid this problem, a WaitForCompletion 
command should be sent after enabling reads, but prior to the next primitive. 



4.5.2 Localbuffer Write 

Writes to the localbuffer must be enabled to allow any update of the localbuffer 
to take place. The LBWriteMode register is a single bit flag which controls 
updating of the buffer. 



Source address 

X 

Y 

LBWindowBase 
LBSourceOffset 

W 
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4.5.3 Localbuffer Data Formats 



The Depth field can be either 1 5 or 1 6 bits wide and the Stencil field either 1 
or bits wide. The total width of the localbuffer data should not be greater than 
1 6 bits. If a Stencil field is defined it occupies bit 1 5; the depth field always starts 
at bit 0. 

The LBReadFormat and LBWriteFormat registers must be configured to the 
appropriate values. The format can be different for different windows. 



4.5.4 Registers 

The LBReadMode register is as shown below in Figure 4-13: 
Figure 4-13. LBReadMode Register 



31 24 


16 


8 







I I I I I I I I I I I I 
Reserved 

I 


i i 

l i 


l l l l 

Reserved 

i i i i 


l 

I 


I I 
PP2 
i i 


I I 
PP1 

i i 


I I 
PPO 

I I 



Patch Enable 



Data Type 



Window origin 




Partial product selection 



Read Source enable 
Read Destination enable 



PatchEnable, when set, enables normal patch addressing of the localbuffer. 
This typically results in more efficient memory bandwidth utilization. 

The Partial Product fields PPO, PP1, and PP2 define the width of the 
localbuffer; they are described in Appendix C. 

ReadSourceEnable and ReadDestinationEnable control localbuffer reads of 
the destination address and source address respectively. DataType controls 
the format of localbuffer data, and WindowOrigin specifies if the window origin 
is Top Left or Bottom Left. 

The localbuffer format must be specified for both reads and writes using the 
LBReadFormat and LBWriteFormat registers see, Figure 4-14. Normally 
these registers are set to identical values. It may be useful to set them to 
different values when copying between two windows using different depth 
widths. 
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Figure 4-14. LBReadFormat / LBWriteFormat Register 



31 


24 




16 8 







I 


I I I I I I 

L L L L L L 


I I I I 
L L L I 


I I I I I I I I I I I I I I 
Reserved 

L L L 1. L L L L L L 1. L L L 


1 


1 

1. 



Stencil Width 

Write Enable 

LBWriteMode is a single bit register, see Figure 4-15. When the least 
significant bit is set, writes to the localbuffer are enabled. 

Figure 4-15. LBWriteMode Register 

31 24 16 8 



I 


1 1 1 1 1 1 


1 1 1 1 


1 1 1 1 1 1 1 1 1 1 
Reserved 


i i i i i i i 

1 


1 





1 1 1 1 








Write Enable 

LBSourceOffset holds a 24 bit 2s complement value used in copy operations. 

LBWindowBase updates the base address of the localbuffer. 

The relative positions of the depth and stencil fields within the localbuffer are 
fixed. If a Stencil field is defined, then it occupies bit 1 5. The depth field always 
commences at bit 0. 

4.5.5 Localbuffer Example 

The following is an example of a rendering operation with localbuffer read and 
write. The TVP401 is configured with a 1 6 bit localbuffer such that 1 5 bits are 
used for depth and 1 bit for stencil with a screen size of 800 x 600. 

IbReadFormat .DepthWidth =3 // 15 bit 

IbReadFormat . StencilWidth =3 // 1 bit 

LBReadFormat ( IbReadFormat ) // Load read format 

LBWriteFormat ( IbReadFormat ) // Write is same as 

read 

// Set the localbuffer write mode 
LBWriteMode (0x1) 

// Set the localbuffer read mode 
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II Partial products for 800 : 512 + 256 + 32 
IbReadMode.PPO =5 1/ 512 (<< 9) 
IbReadMode.PPI = 4 // 256 (<< 8) 
lbReadMode.PP2 = 1 // 32 (<< 5) 
IbReadMode.ReadSource = TVP4 010_DISABLE 
IbReadMode.ReadDestination = TVP4 010_ENABLE 
IbReadMode.DataType = TVP 4 01 0_LBDEFAULT 
IbReadMode . WindowOr igin = as appropriate 
IbReadMode.PatchMode = TVP4 010_DISABLE 
LBReadMode (IbReadMode) 
LBWriteMode (TVP4 010_DI SABLE) 

// Now ready to render with localbuffer read and write 
// suitable for stencil and depth buffering operations. 
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4.6 Stencil/Depth Test Unit 

The stencil test conditionally rejects fragments based on the outcome of a 
comparison between the value in the stencil buffer and a reference value. The 
stencil buffer is updated according to the current stencil update mode which 
depends on the result of the stencil test and the depth test. Stencil testing can 
be used in many different ways, e.g., hidden line removal, decals, masking 
areas of the screen, and stippling. 

The depth (Z) test, if enabled, compares the fragment depth against the 
corresponding depth in the depth buffer. If the test fails, the fragment is 
rejected. 

4.6.1 Stencil Test 

This test only occurs when all the preceding tests (bitmask, scissor, stipple) 
have passed. The stencil test is controlled by the stencil function and the 
stencil operation. The stencil function controls the test between the reference 
stencil value and the value held in the stencil buffer. If the test is LESS and the 
result is true, then the fragment value is less than the source value. The stencil 
operation controls the updating of the stencil buffer and is dependent on the 
result of the stencil and depth tests. 

Table 4-6 shows the stencil functions available: 
Table 4-6. Stencil Comparison Modes 



Mode Comparison Function 






Never 


1 


Less 


2 


Equal 


3 


Less or Equal 


4 


Greater 


5 


Not Equal 


6 


Greater or Equal 


7 


Always 



If the stencil test is enabled, then the stencil buffer is updated depending on 
the outcome of both the stencil and the depth tests (if the depth test is disabled 
the depth result is set to pass). See Tables 4-7 through 4-9 and the definition 
of the StencilMode register in subsection 4.6.3 to fully understand their 
relationship. 
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Table 4-7. Possible Update Operations for Stencil Planes 

Stencil Test 
Pass Fail 
Depth Test Pass dppass sfail 

Fail dpfail sfail 

The entries dppass, dpfail and sfail are set to one of the update operations 
shown in Table 4-8, source stencil is the value in the stencil buffer: 

Table 4-8. Stencil Operations 



Update Method 


Mode 


Stencil Value 


Keep 





Source stencil 


Zero 


1 





Replace 


2 


Reference stencil 


Increment 


3 


Clamp (Source stencil + 1 ) to 2stencil width _ 1 


Decrement 


4 


Clamp (Source stencil -1) to 




5 


-Source stencil 



In addition, a comparison bit mask is supplied in the StencilData register. This 
is used to establish which bits of the source and reference value are used in 
the stencil function test. 

The source stencil value can be from a number of places as controlled by a 
field in the StencilMode register: 

Table 4-9. Stencil Sources 

LBWriteData Stencil Use 



Test logic 
Stencil register 



LBSourceData: (stencil val- 
ue read from the localbuffer) 

Source stencil value read 
from the localbuffer 



This is the normal mode. 

This is used, for instance, in the OpenGL draw pixels function where the 
host supplies the stencil values in the Stencil register. 
It is used when a constant stencil value is needed, for example when 
clearing the stencil buffer . 

This is used, for instance, in the OpenGL copy pixels function when the 
stencil planes are to be copied to the destination. The source is offset from 
the destination by the value in LBSourceOffset register. 

This is used, for instance, in the OpenGL copy pixels function when the 
stencil planes in the destination are not to be updated. The stencil data will 
come from the localbuffer data. 



See The OpenGL Reference Manual and The OpenGL Programming Guide 
from Addison-Wesley for more details of the stencil operations and examples 
of its use. 
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4.6.2 Depth Test 



This test is only performed if all the preceding tests (bitmask, scissor, stipple) 
have passed. The comparison tests available are shown in Table 4-10: 



Table 4-10. Depth Comparison Modes 



Mode Comparison Function 






Never 


1 


Less 


2 


Equal 


3 


Less Than or Equal 


4 


Greater 


5 


Not Equal 


6 


Greater Than or Equal 


7 


Always 



The test compares the fragment depth against a source depth value. If the 
compare function is LESS and the result is true then the fragment value is less 
than the source value. The source value can be obtained from a number of 
places as controlled by a field in the DepthMode register. 



Table 4-11. Depth Sources. 



Source 



Use 



DDA (see below) 
Depth register 



LBSourceData: Source 
depth value from the 
localbuffer 

Source Depth 



This is used for normal Depth buffered 3D rendering. 

This is used, for instance, in the OpenGL draw pixels function where the 
host supplies the depth values through the Depth register. 
Alternatively this is used when a constant depth value is needed, for 
example, when clearing the depth buffer or 2D rendering where the depth is 
held constant. 

This is used, for instance, in the OpenGL copy pixels function when the 
depth planes are to be copied to the destination. 

This is used, for instance, in the OpenGL copy pixels function when the 
depth planes in the destination are not updated. The depth data will come 
from the localbuffer. 



For a depth buffered trapezoid, the TVP401 interpolates from the dominant 
edge of a trapezoid to the subordinate edges. This means that two increment 
values are required, one to move along the dominant edge and one to move 
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across the span to the subordinate edge. This is illustrated in Figure 4-1 6. The 
rendering direction chosen here is bottom to top. 

ZStart = Start Z value 

dZdyDom = Increment along dominant edge. 
dZdx = Increment along the scan line. 

The dZdx value is not required for Z-buffered lines. 
Figure 4-16. Depth Interpolation 




Subordinate Edges 



The number format for the increment values is 2s complement fixed point 
integer: 16 bits integer and 11 bits fraction. All the start, derivative and internal 
data is in this format. This is mapped into the Upper and Lower registers (U 
and L) as shown in Figure 4-17. 

Figure 4-17. Depth Derivative Format 



sign bit 



not used r 16 bits integer 


11 bits fraction 


remaining bits 


U 




L 



This data format is compatible with GLINT 300SX and GLINT 500TX graphics 
processors. In many instances, the fractional part can be left containing zero, 
avoiding the need to continually update ZStartL, dZdxL and dZdyDomL. 

The Depth unit must be enabled to update the depth buffer. If it is disabled then 
the depth buffer will only be updated if ForceLBUpdate is set in the Window 
register. If no updates of the localbuffer are required, setting DisableLBUpdate 
in the Window register may improve performance. 
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4.6.3 Registers 



Stencil test is controlled by the StencilMode register as shown in Figure 4-1 8: 



Figure 4-18. StencilMode Register 



31 



24 



16 



~~\ — i — i — i — i — i — i — i — i — i — i — i — i — i — r 
Reserved 

J I I I I I I I I I I I I I L 



src 



i — i 
func 

I L_ 



sfail 



— i — i — 
dpfail 

_J L_ 



dppass 



Unsigned compare function 



Unit enable 



Stencil source 



Update Method 



The StencilData register holds the other data associated with the test, see 
Figure 4-19. 



Figure 4-19. StencilData Register 

31 24 

1 — i — i — i — i — i — i — i — i — i — i — i — 

Reserved 

j i i i i i i i i i i i 



16 



-i — i — i — i — r 
Reserved 

_l I I I L 



~l 1 1 1 1 1 

Reserved 

J I I I I L_ 



Write Mask Compare Mask Reference Stencil 

The stencil writemask is used to control which stencil planes are updated as 
a result of the test. The Stencil register holds an externally sourced stencil 
value. It is a 32-bit register of which only the least significant bit is used. The 
unused bits should be set to zero. 

The Stencil unit must be enabled to update the stencil buffer. If it is disabled 
then the stencil buffer will only be updated if ForceLBUpdate is set in the 
Window register. 

Operation of the Depth unit is controlled by the DepthMode register as shown 
in Figure 4-20: 

Figure 4-20. DepthMode Register 
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24 


16 


8 





i i i i i i i i i i i i i i i i i i i i i i i i 

Reserved 

i i i 


i i 


i 
i 







Compare Mode New Depth Source 

Write Mask 



Unit enable 



4-46 



Stencil/Depth Test Unit 



The single bit writemask is used to control updating all the bits in the depth 
buffer. 

The Depth register holds an externally sourced 1 6 bit depth value. If the depth 
buffer holds 15bits then the user supplied depth value is right justified to the 
least significant end of the register. The unused most significant bit should be 
set to zero. 

The DDA and other registers are shown in Table 4-1 2 (note that increment val- 
ues are split into two registers): 

Table 4-12. Depth Interpolation Registers 



Register 


Description 


ZStartU 


Depth start value 


ZStartL 




dZdxU 


Depth derivative per unit X 


dZdxL 




dZdyDomU 


Depth derivative per unit Y, dominant edge, or along a line. 


dZdyDomL 





The Window register, (see Figure 4-21) is used to control the update of the 
localbuffer. 

Figure 4-21. Window Register 



31 


24 


16 


8 





i i i i i i i i i i i i i 

Reserved 
i 




i i i i i i i i i i i 

Reserved 
i iii 






i 
i 



LB Update Source 
Disable LB Update Force LB Update 



Reserved 

4.6.4 Stencil Example 

This stencil example sets the Stencil unit to use a supplied reference value 
(0x1 ) and to test fragments to be LESS than this value. It also sets the stencil 
planes update function to Decrement when the test passes and the depth test 
passes (or is not enabled), otherwise, it sets the update function to Keep. 
Because Decrement is the selected mode, this example does not require that 
the Stencil register be loaded. 
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1 1 Set the localbuffer read and write modes 
// See section 4.5 
// Set the stencil modes 
stencilMode.UnitEnable = TVP4 010_ENABLE 

stencilMode.DPPass = TVP4 1 0_STENCIL_METHOD_DECREMENT 
stencilMode.DPFail = TVP4010_STENCIL_METHOD_KEEP 
stencilMode . SFail = TVP4010_STENCIL_METHOD_KEEP 
stencilMode.CompareFunction = TVP4010_STENCIL_COMPARE_LESS 
stencilMode . StencilSource = TVP4010_SOURCE_TEST_LOGIC 
StencilMode (stencilMode) 

// Set the reference stencil value and set the 
/ / compare and writemasks to 0x1 
stencilData . Ref erenceStencil = 0x1 
stencilData . CompareMask = 0x1 
stencilData . StencilWriteMask = 0x1 
StencilData (stencilData) 

// Enable the depth test here if required, if not enabled 
// the result of the depth test is set to pass. 

4.6.5 Depth Example 

This depth example does the required set up for drawing a depth buffered 
primitive. 

// Set the localbuffer read and write modes 
// See section 4.5 

depthMode.UnitEnable = TVP4 010_ENABLE 
depthMode . WriteMask = 1 

depthMode . NewDepthSource = TVP4 010_NEW_DEPTH_SOURCE_DDA 
depthMode . CompareMode = TVP4010_DEPTH_COMPARE_MODE_LESS 
DepthMode (depthMode) 

// Load the depth start values and deltas for the dominant 
edge 

// and the body of the trapezoid 

ZStartUO // Load upper and lower start values 
ZStartL () 

dZdxUO // Load upper and lower dZdx deltas 

dZdxL () 

dZdyDomU ( ) // Load upper and lower dominant edge deltas 

dZdyDomL ( ) 

// Render primitive 
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4.7 Texture Address Unit 

The Texture Address unit calculates the address of the texel that maps to the 
current fragment XY position. Perspective correction can be applied as part 
of the operation. 

The texture coordinates are referred to as S and T where S is analogous to X 
and T to Y. The S and T values are generated by interpolation; a third 
component, Q, may also be interpolated and is used in perspective correction. 

4.7.1 Texture Interpolation 

The DDA units perform linear interpolation given a set of start and increment 
values. 

The TVP401 interpolates from the dominant edge of a trapezoid to the subor- 
dinate edges. This means that two increment values are required per texture 
component, one to move along the dominant edge and one to move across 
the span to the subordinate edge. This is illustrated, for the S component, in 
the diagram of Figure 4-22: 

Figure 4-22. Texture Address Interpolation 




Subordinate Edges 



SStart = Initial S value 

dSdyDom = S gradient in the Y direction along the domi- 
nant edge 

dSdx = S gradient in the X direction 

The calculation for the delta values is the same as other parameters such as 
depth values (see subsection 4.2.5). 

When perspective correction is not enabled, then the S and T values are the 
texture coordinates of the appropriate vertex. When perspective correction is 
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4.7.2 Registers 



enabled the texture coordinates are divided by the homogenous coordinate W, 
and Q is formed from 1/W. S and T are then normalized with respect to Q so 
that Q lies in the range 1 to 1/127. These values are then used to calculate 
delta values in the same way as for color or depth. If the dynamic range of Q 
is such that it cannot be normalized to the supported range, the software 
should either tessellate the triangle into smaller regions to reduce the range 
or accept a reduction in accuracy; a Q value of zero will be handled in a 
reasonable manner. 

If perspective correction is enabled, each interpolated S and T value is divided 
by the interpolated Q value. If fast perspective correction is enabled, the a 
faster but less accurate division is used. The result is passed to the Texture 
Read unit which reads the texel from memory. 

If subpixel correction has been enabled for a primitive, then any correction 
required will be applied to the texture coordinates. 



The S and T values are in 30-bit 2s complement format, see Figure 4-23. 



Figure 4-23. Fixed Point S and T Format 
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The Q values are in 29-bit 2s complement format, see Figure 4-24. 



Figure 4-24. Fixed Point Q Format 
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The registers to set up Texture interpolation are shown in Table 4-13. 
Table 4-13. Texture Interpolation Registers 



Register 


Data Field 


Description 


SStart 


30 bit 2s comp fix pt 


S start value 


dSdx 


30 bit 2s comp fix pt 


S derivative per unit X 


dSdyDom 


30 bit 2s comp fix pt 


S derivative per unit Y, dominant edge 


TStart 


30 bit 2s comp fix pt 


T start value 


dTdx 


30 bit 2s comp fix pt 


T derivative per unit X 


dTdyDom 


30 bit 2s comp fix pt 


T derivative per unit Y, dominant edge 


QStart 


29 bit 2s comp fix pt 


Q start value 


dQdx 


29 bit 2s comp fix pt 


Q derivative per unit X 


dQdyDom 


29 bit 2s comp fix pt 


Q derivative per unit Y, dominant edge 



To enable accurate perspective correction, the Perspective Correction bit in 
the TextureAddressMode register must be set, see Figure 4-25. To enable, 
fast, though less accurate, perspective correction, both the Fast and 
Perspective Correction bits must be set. For many applications, fast 
perspective correction provides more than adequate results. Note that the 
Texture Enable bit in the Render command must also be set for texture 
mapping. 

Figure 4-25. TextureAddressMode 
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Delta Format 

Fast 

Perspective Correction 

Enable umi 

When the TVP401 is being used in conjunction with a GLINT Delta processor, 
the Delta Format bit in the TextureAddressMode register should be set. This 
allows the TVP4010 to make use of the texture delta values e.g. dSdx, that 
are generated by the GLINT Delta. When the TVP401 is configured without 
a GLINT Delta processor, this bit should be reset. 

4.7.3 Texture Interpolation Example 

This texture interpolation example sets up the parameters for 2D texture 
mapping. 1 D texture mapping can be achieved by setting TStart, dTdx and 
dTdyDom to zero. 
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I / Load the start values and deltas for the dominant edge 

// and the body of the trapezoid 

SStartO // Load S start value 

TStartO // Load T start value 

QStartO // Load Q start value 

dSdx() // Load S delta for X 

dTdx() // Load T delta for X 

dQdx() // Load Q delta for X 

dSdyDom() // Load S dominant edge delta 

dTdyDom() // Load T dominant edge delta 

dQdyDom() // Load Q dominant edge delta 

// Render primitive 
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4.8 Texture Read Unit 

The texture buffer holds texture data. The buffer shares the same memory as 
the localbuffer and framebuffer; texture maps are normally written to memory 
through the framebuffer write unit in a similar manner to image download. 

The Texture Read unit receives texture addresses from the Texture Address 
unit and reads data from memory. If bilinear filtering is enabled, several 
accesses may be done to collect the correct number of texels. 

4.8.1 Read Unit 

The address calculation implements the following equations: 
Bottom left origin: 

Address = TextureBaseAddress - T* W + S 
Top left origin: 

Address = TextureBaseAddress + T * W + S 
where: 

Address is the address any read will be made from. 

S is the texel's S coordinate. 

T is the texel's T coordinate. 

TextureBaseAddress holds the base address of the current texture. 

W is the texture width. Only a subset of widths are supported and these 

are encoded into the PPO, PP1 and PP2 fields in the TextureRead- 
Mode register. See Appendix C for more details. 

The TextureMapFormat register specifies how the texture map is held in 
memory. This includes the width of the texture map using partial product codes 
and the size of the texel. The TextureReadMode register specifies how the 
texture map should be handled internally. This sets the width (maximum S) 
and height (maximum T) that should be used when accessing the texture. 
There are three ways that the address can be modified when it exceeds either 
the width or height (or goes negative) as follows: 

Clamp clamps the coordinate to or the maximum value. 

Repeat accesses the map modulo the width or height. This results in the 

texture map being repeated. 

Mirror accesses the map modulo the width or height and mirror alternate 

texture maps. 

The width used to repeat or clamp can be different to the width used to set the 
stride of the texture in memory. This allows a texture to be selected from part 
of a larger image. 
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4.8.2 Texture Filtering 

A bilinear filter is available which combines the values of the four texels 
surrounding the index into the texture map to produce a single value. This filter 
will reduce pixelation effects when textures are enlarged and reduce aliasing 
effects when textures are shrunk. 

4.8.3 Texture Formatting 

The texture map can be held in memory in a variety of formats that correspond 
to the formats supported by the framebuffer. Two additional formats are 
provided to allow texture maps to be stored in YUV color format. When a texel 
is read into the TVP401 0, it is converted to the internal color format. External 
color formats are shown in Table 4-1 . 

NOTE: 

The color format value is made up of the 4 bits of the TextureFormat field and 
the 1 bit TextureFormatExtension field in the TextureDataFormat register. 

If the selected format has no alpha buffer, a default value of 0xF8, which is the 
maximum is used. If the NoAlphaBuffer bit is set in the TextureDataFormat 
register, then 0xF8 is used even if the format has an alpha buffer. 

If the texture is in Color Index mode (either 4 or 8 bits) the single value is 
repeated for all color components. If the framebuffer format is also Color Index, 
the single value is used as the pixel color; if the framebuffer is RGBA, then the 
texture value becomes grey scale. If a 4-bit CI texel is written to an 8 bit CI 
framebuffer, the value is written to the upper four bits of the framebuffer byte. 

The texture values can be indexed through an 8-bit lookup table inside the 
TVP4010. This LUT holds 16 RGB values. If the 4-bit CI mode is used, one 
RGB color can be mapped to each texture value. If the 8-bit CI mode is used, 
the index is taken from the upper four bits of the texel value. 

4.8.4 Registers 

The TextureReadMode register controls the way that textures are read from 
memory. 

The S and T wrap modes can be set to clamp, repeat or mirror as described 
earlier. 

With Filter Mode disabled, nearest-neighbor texture mapping will be 
performed. With this bit set, bilinear filtering is enabled. 

The Packed Data bit is used to define how texels are read from memory. If this 
bit is cleared, each texel is read one at a time; if set, several texels can be read 
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simultaneously improving efficiency. The actual number of texels read in this 
case is dependant on the texel size. See subsection 4.10.4 for how this can 
be used for packed copies. 

The TextureReadMode register (see Figure 4-26) controls the way that 
textures are read from memory. With Filter Mode disabled, nearest-neighbor 
texture mapping is performed. With it set, bilinear filtering is enabled. 

Figure 4-26. TextureReadMode Register 
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The TextureMapFormat register (see Figure 4-27) specifies the way that the 
texture map is held in memory. The partial product codes are detailed in 
Appendix C. The window origin specifies the origin as being top left or bottom 
left. SubPatchMode when enabled, improves the performance of typical 
texture mapping. 

Figure 4-27. TextureMapFormat Register 
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The TextureDataFormat register (see Figure 4-28) specifies the color format 
of the texture. The TextureFormat combined with the TextureFormat Extension 
contain one of the modes described in Table 3-1 of Chapter 3. The color order 
specifies whether the texture is in RGB or BGR color format. 
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Figure 4-28. TextureDataFormat Register 
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The TexelLUTO to 15 registers contain the texture color look-up table. Each 
register contains 5 bit fields for red, green and blue color components. The 
TexelLUTMode register (see Figure 4-29) allows use of the TexelLUTO to 15 
registers. When enabled, the texel value becomes an index into this look-up 
table. 

Figure 4-29. TexelLUTMode Register 
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4.8.5 Texture Download Example 

The following is an example of texture downloading: 

fbReadMode.PatchMode = TVP4010_TRUE 
fbReadMode . SubPatchMode = TVP4010_SUBPATCH 
FBReadMode (fbReadMode) ; 
fbWriteMode. Enable = TVP4 010_TRUE 
FBWriteMode (fbWriteMode) 
// Set format to 8 bits 
ditherMode.UnitEnable = TVP4010_TRUE 
ditherMode. Enable = TVP4010_FALSE 

ditherMode . ColorMode = TVP4 010_COLOR_FORMAT_RGB_332 
DitherMode (ditherMode) 
/ / Do image download 
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4.8.6 Texture Mapping Example 

The pseudecode to texture map a trapezoid follows: 

textureAddressMode . Enable = TVP4010_TRUE 
textureAddressMode . PerspectiveCorrection = TVP4010_TRUE 
TextureAddressMode (textureAddressMode ) 
// Load texture address parameters 

SStart() 

dSdx () 
dSdyDom() 

TStartO 

dTdx ( ) 
dTdyDom ( ) 

QStart() 

dQdx ( ) 
dQdyDom() 

// Configure texture read 
textureReadMode. Enable = TVP4 010_TRUE 

textureReadMode . SWrapMode = TVP4 010_TEXTURE_WRAP_REPEAT 
textureReadMode . TWrapMode = TVP4 010_TEXTURE_WRAP_REPEAT 
textureReadMode . Width = width 
textureReadMode . Height = height 
textureReadMode. FilterMode = TVP4010_FALSE 
TextureReadMode (textureReadMode ) 
textureMapFormat . PPO = partialProductO 
textureMapFormat . PP1 = partialProductl 
textureMapFormat . PP2 = partialProduct2 
textureMapFormat . SubPatchMode = TVP4010_TRUE 
textureMapFormat . TexelSize = TVP4010_8_BITS_PER_TEXEL 
TextureMapFormat (textureMapFormat ) 

textureDataFormat . TextureFormat = TVP4 010_COLOR_FOR- 
MAT_RGB_3 3 2 

TextureDataFormat (textureDataFormat ) 

// Enable texture/fog/blend unit, load other parameters 
// and render 
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4.9 YUV Unit 

The YUV unit converts the YUV color format, also known as YCbCr, to RGB 
format. It also does chroma-key testing. Chroma-key testing may be done 
either before or after the conversion. 

The YUV conversion is done on data that is being loaded into the TexelO 
register. The data for this may come from the TextureRead unit or from the 
host, so YUV conversion can be done either during texture download or on a 
texture as it is applied to a primitive. The YUV data can be in either 444 format 
or 422 format. The chroma test may be done with either YUV or RGB data. 

4.9.1 Chroma Test 

The chroma test specifies upper and lower bounds against which the TexelO 
value is tested. The test may be set to pass when the components of TexelO 
are either all inside or all outside the bounds. This is controlled by the 
accept/reject TestMode options of the YUVMode register. If the test passes, 
the TexelO data may be used in the Texture/Fog/Blend unit as normal. If the 
test fails, then the fragment to which the texture data maps, may be rejected 
(not plotted). This is useful for cut-outs and sprites. Alternatively, on test 
failure, the TexelO value may be rejected and the texture operation on the 
fragment suppressed. This is achieved by setting the RejectTexel bit in the 
YUVMode register. In this case the underlying color provided by the TVP401 
is used without being modified by the texture color. This is useful for applying 
a logo to a shaded polygon where the underlying color is provided by the Color 
DDA unit. 

The test modes available are shown in Table 4-14. 
Table 4-14. Chroma Test Modes 



Mode 


Test Mode 





No test 


1 


Accept 


2 


Reject 



Chroma-key testing can be done without involving texture mapping. This is 
achieved by setting the TexelDisableUpdate field in the YUVMode register 
(see Figure 4-30). This allows fragments to be rejected by chroma testing as 
part of a copy operation. If chroma testing is required against the destination 
color of a copy (i.e., only overwrite pixels of the specified color), then the 
destination region of the screen is used as the texture map and the framebuffer 
units are set-up to do a normal copy. The texels are read in and tested, and 
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fragments rejected if the colors do not match. If the fragment has been 
rejected, then the copy for that pixel will not take place. Setting the 
TexelDisableUpdate bit discards the texel as soon as the test has been done 
which improves performance. 

Figure 4-30. YUV Mode Register 
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The TestData bit controls when the chroma test occurs in relation to the color 
conversion. Setting this bit causes the chroma test to occur on the output of 
the unit; clearing it causes the chroma test to occur on the input i.e., after or 
before color conversion respectively, assuming the Enable bit is set. 

The TestMode can be set as follows: (see Figure 4-31) 

□ Accept, i.e., pass test if (upper bound <= color >= lower bound 

□ Reject, i.e., fail test if (upper bound <= color >= lower bound 

Figure 4-31. ChromaUpperBound and ChromaLowerBound Registers 
RGB format 
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4.10 Framebuffer Read and Write Units 

Before drawing can take place, the TVP4010 must be configured to perform 
the correct framebuffer read and write operations. Framebuffer read modes 
affect the operation of alpha blending, logic ops, software writemasks, image 
upload and image copy operations. Framebuffer write modes are relevant to 
all drawing in the framebuffer. 

4.10.1 Framebuffer Read 

The FBReadMode register allows the TVP4010 to be configured to make 0, 
1 or 2 reads of the framebuffer. The following are the most common modes of 
access to the framebuffer: 

□ Rendering operations with no logical operations, software writemasking 
or alpha blending. In this case no read of the framebuffer is required and 
framebuffer writes should be enabled. Framebuffer reads should be 
disabled for maximum efficiency. 

□ Rendering operations which use logical ops, software writemasks or alpha 
blending. In these cases the destination pixel must be read from the frame- 
buffer and framebuffer writes must be enabled. 

□ Image copy operations. Here set up depends on whether logical ops, 
software writemasks and/or alpha blending are occurring with the copy. If 
any of these are, the framebuffer needs two reads, one for the source and 
one for the destination. Otherwise, only one read is required. 

□ Image upload. This requires reading of the destination framebuffer pixels 
to be enabled and framebuffer writes to be disabled. 

□ Image download. This case requires no framebuffer reads (as long as soft- 
ware writemasking, alpha blending and logic ops are disabled) and the 
write must be enabled. 

Note that avoiding unnecessary additional reads will enhance performance. 

i 1 

Note: 

The OpenGL specification, allows any combination of the Front, Back, Left 
and Right color buffers to be updated simultaneously. In this case a scene 
would be rendered multiple times changing the FBPixelOffset as 
appropriate. When using this mode it is important to ensure that the buffers 
that affect the rendering are updated only once. For example, when 
rendering with depth buffering enabled, localbuffer writes should only be 
enabled for the last buffer updated. 
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For both the read and the write operations, an offset is added to the calculated 
address. The source offset (FBSourceOffset) is used for copy operations. The 
pixel offset (FBPixelOffset) can be used to allow multi-buffer updates. The 
offsets should be set to zero for normal rendering. The address calculation 
implements the following equations: 

Note: 1) 

Bottom left origin: 

Destination address = FBWindowBase - Y * W + X + FBPix- 
elOffset 

Source address = FBWindowBase - Y * W + X + FBPixelOff- 
set + FBSourceOffset 

Top left origin: 

Destination address = FBWindowBase + Y * W + X + FBPix- 
elOffset 

Source address = FBWindowBase + Y * W + X + FBPixelOff- 
set + FBSourceOffset 

where: 

Destination Address is the address in the framebuffer which is written to if writes are 
enabled, and is also the address read when Read Destination is 
enabled. 

is the address in the framebuffer which is read from when Read- 
Source is enabled. 

is the pixel's X coordinate, 

is the pixel's Y coordinate, 

holds the base address in the framebuffer of the current window. 

is normally zero except when multi-buffer writes are needed 
when it gives a way to access pixels in alternative buffers 
without changing the FBWindowBase register. This is useful as 
the window system may be asynchronously changing the 
window's position on the screen. It is held in the FBPixelOffset 
register. 

is normally zero except during a copy operation where data is 
read from one address and written to another address. The 
FBSourceOffset is held in the FBSourceOffset register and is the 
offset from destination to source. 

is the screen width. Only a subset of widths are supported and 
these are encoded into the PPO, PP1 and PP2 fields in the 
FBReadMode register. See Appendix C for more details. 

The data read from the framebuffer may be either FBDefault (data which may 
be written back into the framebuffer or used in some manner to modify the 
fragment color) or FBColor (data which will be uploaded to the host). Table 



Source Address 

X 
Y 

FBWindowBase 
FBPixelOffset 



FBSourceOffset 



W 
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4-15 summarizes the framebuffer read/write control for common rendering 
operations. 



Table 4-15. Framebuffer Read/Write Modes 



ReadSource 


ReadDestination 


Writes 


Read Data 
Type 


Rendering Operation 


Disabled 


Disabled 


Enabled 




Rendering with no logical operations, 
software writemasks or alpha blending. 


Disabled 


Disabled 


Enabled 




Image download. 


Disabled 


Enabled 


Disabled 


FBColor 


Image upload. 


Enabled 


Disabled 


Enabled 


FBDefault 


Image copy with hardware writemasks. 


Disabled 


Enabled 


Enabled 


FBDefault 


Rendering using destination-only 
logical operations, software writemasks 
or alpha blending. 


Enabled 


Enabled 


Enabled 


FBDefault 


Image copy with logical operations, 
software writemasks or alpha blending. 



Incorrect data can be read if reads are enabled but the same data has just been 
written with reads disabled. To avoid this problem, a WaitForCompletion 
command should be sent after enabling reads, but prior to the next primitive. 



4.10.2 Framebuffer Write 

Framebuffer writes must be enabled to allow the framebuffer to be updated. 
A single 1-bit flag controls this operation. 

The Framebuffer Write unit is also used to control the operation of fast block 
fills, if supported by the framebuffer. Fast fill rendering is enabled via the Fast- 
FillEnable bit in the Render command register. The FBBIockColor register 
holds the data written to the framebuffer during a block fill operation and should 
be formatted to the 'raw' framebuffer format. When using the framebuffer in 8 
bit packed mode, the data should be repeated in each byte. When using the 
framebuffer in packed 1 6 bit mode, the data should be repeated in the top 1 6 
bits. 

When uploading images the UpLoadData bit can be set to allow color 
formatting. See subsection 4.12.6 for more details. 

4.10.3 Patching 

Data in the framebuffer can use patched addressing to improve performance 
under certain circumstances. However, only non-visible data is normally 
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patched. Patch mode organizes data for efficient drawing of scanline 
primitives; it also helps line drawing. This form is typically used in the 
localbuffer, see subsection 4.5.4, for patching the depth buffer. The SubPatch 
mode re-organizes data for efficient texture operations; see section 4.8.4. 
SubPatchPack mode is used when 4-bit textures are loaded as 8 bits i.e., the 
subpatch packing takes into account the two texels per byte. 

4.10.4 Packed Copies 

Packed copies move 32 bits at a time even though the real pixel size may be 
8 or 16 bits. The PackedDataLimits register holds the left and right X 
coordinates for the destination area of the screen in the native pixel format. 
Any pixels outside this area are not plotted. The relative offset field in the 
FBReadMode register specifies the number of pixels that the source data has 
to be adjusted to align with the destination data. 

4.10.5 Image Downloads 

An image download can be performed in one of four ways. It can be achieved 
by loading the data in standard color format into the Color register and using 
the Color Format unit to organize it into the framestore format. Or it can be 
achieved by loading the data in raw framebuffer format either into the Color 
register or the FBData register. The former requires that the Color Format unit 
is disabled whilst the latter ignores this unit. Alternatively, the data can be 
loaded as some other raw format into the FBSourceData register and have the 
Texture/Fog/Blend unit convert it into the internal color format. The Color 
Format unit can then convert it into the arrangement to be stored in the 
framebuffer. Both techniques require setting up the Rasterizer appropriately. 

4.10.6 Fast Texture Download 

Normal texture download is done as an image download. This involves setting 
up the Rasterizer to draw a rectangle and changing the state of a number of 
units. This is a good way to load the texture if any processing needs to be done, 
such as color format conversion, color space conversion or patching. 

If the texture is held on the host in the raw framebuffer format, the fast texture 
download approach can be used. The Texture Down I oadOff set register holds 
the base address of the framebuffer using 32-bit pixel addressing. The 
TextureData register holds the texture data in raw framebuffer format 32 bits 
at a time. The load of this register is ignored by all other units in the pipeline 
so no state needs to be saved and restored. Following the receipt of each 
TextureData value, the TextureDownloadOffset value is incremented. If this 
register is read, it returns the current count, not the original value. 
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If fast download is used, the texture map on the host must be in the format it 
will be stored in memory, including any color formatting, byte swapping, or 
address patching. If a texture will be loaded several times, it can be 
downloaded as an image the first time using all formatting controls, and then 
uploaded again as a raw image for later use. 

Using this technique, framebuffer writes do not need to be enabled. 

4.10.7 Hardware Writemasks 

Hardware writemasks, if available, are controlled using the FBHardwareWrite- 
Mask register. If the framebuffer memory devices support hardware write- 
masks and they are to be used, then software writemasking should be disabled 
(by setting all the bits in the FBSoftwareWriteMask register). See subsection 
4.1 5.3. This will result in fewer framebuffer reads when no logical operations 
or alpha blending is needed. 

If the framebuffer is used in 8-bit packed mode, then an 8-bit hardware write- 
mask must be repeated in all four bytes of the FBHardwareWriteMask register. 
If the framebuffer is in 1 6-bit packed mode then the 1 6-bit hardware writemask 
must be repeated in both halves of the FBHardwareWriteMask register. 

As there is no overall enable for this feature, the hardware writemask MUST 
be set to all ones, except when hardware writemasking is explicitly required. 

4.10.8 Frame Blank Synchronization 

The SuspendUntilFrameBlank command register may be used to stall the 
TVP4010 pipeline until the next frameblank. For double buffering, it is 
beneficial to synchronize to the monitor blanking. By using this register, full 
screen double buffering can be controlled through the pipeline and the host 
does not need to wait for vertical frame blank itself. Instead, once the 
SuspendUntilFrameBlank command register has been loaded, the host can 
continue to load the TVP401 registers and issue commands. The TVP401 
continues to process these as long as they do not involve writing to the 
framebuffer. The data field of this register is the base address of the buffer to 
be displayed and is passed to the Internal Video Timing generator. 
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4.10.9 Registers 

The FBReadMode register layout is shown in Figure 4-32: 
Figure 4-32. FBReadMode Register 
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Patch Mode 
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Partial Production selection 



Packed data 

Patch enable \ Window origin \ Read Source enable 
Reserved Read Destination enable 




See Appendix C for more information on setting partial product codes. 

The FBWindowBase register holds the base address of the window in the 
framebuffer in 24-bit unsigned format. The FBPixelOffset and FBSourceOffset 
registers hold 24-bit 2s complement offsets used in copy operations and 
multi-buffer updates, as described above. 

The FBWriteMode controls the framebuffer write operations as shown in 
Figure 4-33: 



Figure 4-33. FBWriteMode Register 
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The FBReadPixel sets the pixel size, Figure 4-34. 
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Figure 4-34. FBReadPixel Register 
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The PackedDataLimits register is used to control packed copies see Figure 
4-35. 



Figure 4-35. PackedDataLimits Register 
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FBHardwareWriteMask is a 32-bit register where each bit acts as a mask. 
FBColor is a read-only register which returns the data to the host during image 
upload operations. 
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4.10.10 Image Copy Example 

This image copy example copies a rectangular region of the framebuffer, 
without moving any data in the localbuffer. The region extends from the origin 
(0,0) to (100,100) and will be shifted right by 200 pixels. The destination 
rectangle is scan converted. 

// First set-up the framebuffer read mode 
fbReadMode .ReadSource = TVP4 010_ENABLE 
fbReadMode . ReadDestination = TVP4 010_DI SABLE 
fbReadMode .DataType = TVP 4 01 0_FBDEFAULT 
FBReadMode ( fbReadMode) // Update register 

// Now enable framebuffer write 
fbWriteMode . WriteEnable = TVP4 010_ENABLE 
FBWriteMode (fbWriteMode) // Update register 

// Offsets. No Pixel offset, source offset of 200 
FBPixelOf f set (0x0) 
FBSourceOf f set (-200) 

// All the tests which could remove the fragment must 
//be disabled (Stipple, Stencil, Depth) except 
// the Scissor test which is still needed for screen 
// and possibly window clipping. 

// If software writemasks are to be used then they are 
// set appropriately, and the framebuffer set up to do 
// extra read operation 

// Disable the Color DDA unit, we do not want to 
// associate a color with this fragment. 
colorDDAMode . Unit Enable = TVP4010_FALSE 
ColorDDAMode (colorDDAMode) 

// Define the region we wish to copy from. 

StartXDom (200<<16) 

StartXSub (300<<16) 

dXSub (0) 

dXDom (0) 

StartY (0) 

dY (1«16) 

Count (100) 

render . PrimitiveType = TVP4010_TRAPEZOID 
Render (render) // Start the rasterization 
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4.11 Color DDA Unit 

The Color DDA unit is used to associate a color with a fragment produced by 
the Rasterizer. This unit should be enabled for rendering operations and 
disabled for pixel rectangle operations (i.e., copies, uploads and downloads). 

4.11.1 RGBA and Color-lndex(CI) Modes 

Two color modes are supported by the TVP401 0, true color RGBA and color 
index (CI). 

The TVP401 internal color representation is RGBA with 8 bits per component 
as shown in Figure 4-36: 

Figure 4-36. TVP4010 Color Representation 
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This format is the same for all the different framebuffer configurations 
supported. If the number of bits in the framebuffer for a color component is less 
than eight, then the color value is left shifted into the most significant bits of that 
components field. The unused least significant bits should be set to zero. 

In CI mode, the color index is placed in the lower byte of the 32-bit register (i.e., 
the red component). 

4.11.2 Gouraud Shading 

When in Gouraud shading mode, the Color DDA unit performs linear 
interpolation given a set of start and increment values. Clamping is used to 
ensure that the interpolated value does not underflow or overflow the 
permitted color range. 

For a Gouraud shaded trapezoid, the TVP401 interpolates from the dominant 
edge of a trapezoid to the subordinate edges. This means that two increment 
values are required per color component, one to move along the dominant 
edge and one to move across the span to the subordinate edge. This is 
illustrated in the diagram of Figure 4-37, where C represents a color 
component (red, green, blue or color index). Alpha is not interpolated and 
stays at its initial value. 
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Figure 4-37. Color Interpolation 




CStart = the initial color value 

dCdyDom = color gradient in the Y direction along the 
dominant edge 

dCdx = color gradient in the X direction 

See section 4.2, for the delta values for a Gouraud Shaded Triangle. 

For Gouraud shaded lines, each line is treated as the dominant edge of a 
trapezoid, and so no dCdx increment is required. 

To allow accurate interpolation, the increment values are specified in a 1 7-bit 
fixed point format. The format is 2s complement with 1 -bit sign, 5 bits integer 
and 11 bits fraction as shown in Figure 4-38: 

Figure 4-38. Fixed Point Color Format 
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Note that if you are rendering to multiple buffers and have initialized the start 
and increment values in the Color DDA unit, then any subsequent Render 
command will cause the start values to be reloaded. 

If subpixel correction has been enabled for a primitive, then any correction 
required will be applied to the color components. 

4.11.3 Flat Shading 

In flat shading mode, a constant color is associated with each fragment. This 
color is loaded into the ConstantColor register which has the format shown 
previously in Figure 4-36. 
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4.11.4 Registers 

The main control register for the Color DDA unit is the ColorDDAMode register, 
see Figure 4-39. 

Figure 4-39. ColorDDAMode Register 
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Shading Mode Unit Enable 

The registers to set up Gouraud shading in the Color DDA unit are as shown 
in Table 4-16. 

Table 4-16. Color Interpolation Registers 



Register 


Data Field 




Description 


RStart 


17 bit 2s comp 


fix pt 


Red start value 


dRdx 


17 bit 2s comp 


fix pt 


Red derivative per unit X 


dRdyDom 


17 bit 2s comp 


fix pt 


Red derivative per unit Y, dominant edge 


GStart 


17 bit 2s comp 


fix pt 


Green start value 


dGdx 


17 bit 2s comp 


fix pt 


Green derivative per unit X 


dGdyDom 


17 bit 2s comp 


fix pt 


Green derivative per unit Y, dominant edge 


BStart 


17 bit 2s comp 


fix pt 


Blue start value 


dBdx 


17 bit 2s comp 


fix pt 


Blue derivative per unit X 


dBdyDom 


17 bit 2s comp 


fix pt 


Blue derivative per unit Y, dominant edge 


AStart 


17 bit 2s comp 


fix pt 


Alpha start value 



4.11.5 Flat Shading Example 

The pseudocode for a flat shaded primitive example follows: 

// Set DDA to flat shade mode 
ColorDDAMode .UnitEnable = TVP4 010_ENABLE 
ColorDDAMode . Shade = TVP 4 01 0_FLAT_SHADE_MODE 
ColorDDAMode ( ColorDDAMode ) 

ConstantColor (OxFFFFFFFF) // Load the flat color 
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4.11.6 Gouraud Shaded Trapezoid Example 

See subsection 4.2.3 for details of how to calculate delta values. 

/ / Enable unit in Gouraud shading mode 
colorDDAMode . Unit Enable = TVP4 010_ENABLE 
colorDDAMode . Shade = TVP4 010_GOURAUD_SHADE_MODE 
ColorDDAMode (colorDDAMode) 

// Load the color start values and deltas for dominant 
// edge and the body of the trapezoid 
RStartO // Set-up the red component start value 

/ / Set-up the red component increments 



dRdx ( ) 
dRdyDom ( ) 
GStart () 
dGdx ( ) 
dGdyDom ( ) 
BStart () 
dBdx ( ) 
dBdyDom ( ) 



// Set-up the green component start value 

/ / Set-up the green component increments 

// Set-up the blue component start value 

// Set-up the blue component increments 



4.11.7 Gouraud Shaded Line Example 

See section 4.2.3 for details of how to calculate delta values. 

/ / Set DDA for Gouraud shaded mode 
colorDDAMode .UnitEnable = TVP4 010_ENABLE 
colorDDAMode . Shade = TVP4 010_GOURAUD_SHADE_MODE 
ColorDDAMode (colorDDAMode) 

// For lines we need only start values and dominant 
// edge deltas 

RStartO // Set-up the red component start value 

dRdyDom ( ) // Set-up the red component increment 
GStart () // Set-up the green component start value 

dGdyDom ( ) // Set-up the green component increment 
BStart () // Set-up the blue component start value 

dBdyDom ( ) // Set-up the blue component increment 
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4.12 Texture/Fog/Blend 

The Texture/Fog/Blend unit applies effects to the interpolated color. The 
effects are applied in the order: texture then fog then blend. 



4.12.1 Texture Application 

There are two major types of texture application, one suitable for RGB 
applications and one suitable for Ramp applications; Ramp applications use 
RGB textures and framebuffer format but are limited to a white-light source. 
The enable bit in the TextureColorMode register and the TextureEnable bit in 
the Render register must both be enabled before texture is applied. 



4.12.2 RGB Texture Application 

The RGB texture application is referred to elsewhere as the OpenGL type of 
texture application. It can be done in one of three ways. 

In copy mode, the texture color replaces the current fragment color. 

In decal mode the texture color is blended with the fragment color using the 
texture alpha value: 

C f = C t A t + Cf(l-At) 
A f = A f 

where: C f is the fragment color, C t is the texture color , A f fragment alpha and 
A t is the texture alpha. If the texture alpha value is one, decal becomes the 
same as copy. 

In modulate mode the color components are multiplied together: 

c f = c t c f 

A f = A t A f 

where: C f is the fragment color, C t is the texture color, A f fragment alpha and 
A t is the texture alpha. 

4.12.2.1 Ramp Texture Application 

The texture application is referred to elsewhere as the Apple type of texture 
application because of the approach adopted by QuickDraw3D. This type of 
texture application is done three stages, where each stage can be 
independently enabled or disabled. The first stage is decal, which does the 
operation: 

C f = C t A t + Cf(l-At) 
A f = A f 
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If decal is not enabled then the following operation is done: 

c f = c t 

A f = A t A f 

The next operation is modulate, which does: 

C f = K d C D 
A f = K d A D 

where: C f is the fragment color, K d is an interpolated parameter which 
represents the diffuse light intensity, A t is the texture alpha, C D is the color after 
the decal operation and A D is the alpha value after the decal operation. 

The next operation is highlight: 

Cf = C M +Kg 

A f = A M +K S 

where: C f is the fragment color, K s is an interpolated parameter which 
represents the specular or highlight intensity, A t is the texture alpha, C M is the 
color after the modulate operation and A M is the alpha value after the modulate 
operation. 

4.12.3 Fog Application 

The fog unit is used to combine the incoming fragment color (generated by the 
Color DDA unit, and potentially modified by the texture unit) with a pre-defined 
fog color. Fogging can be used to simulate atmospheric fogging and used also 
to depth-cue images. 

Fog application has two stages; derivation of the fog index for a fragment, and 
application of the fogging effect. The fog index is a value which is interpolated 
over the primitive using a DDA in the same way color and depth are 
interpolated. The fogging effect is applied to each fragment using the equation 
described below. 

Note that although the fog values are linearly interpolated over a primitive, the 
fog values at each vertex can be calculated on the host using a linear fog 
function (typically for simple fog effects and depth-cueing) or a more complex 
function to model atmospheric attenuation. This might be an exponential 
function. 

A fog test is supported that will reject a fragment if its fog value is negative. This 
may be used if the background of the scene has been cleared to the fog color; 
any pixels that are far enough from the eye to be completely fogged need not 
be plotted. 

The enable bit in the FogMode register and the FogEnable bit in the Render 
register must both be enabled before fog will be applied. 
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4.12.4 Fog Index Calculation - The Fog DDA 

The fog DDA is used to interpolate the fog index (F) across a primitive. For a 
fogged trapezoid, the TVP4010 interpolates from the dominant edge of a 
trapezoid to the subordinate edges. This means that two increment values are 
required, one to move along the dominant edge and one to move across the 
span to the subordinate edge. This is illustrated in the diagram below. The 
rendering direction chosen here is bottom to top. 

FStart = Start fog value 

dFdyDom = Increment along dominant edge. 

dFdx = Increment along the scan line. 

The dFdx value is not required for fogged lines. 

The mechanics are similar to those of the other DDA units, as shown in Figure 
4-40: 

Figure 4-40. Fog Interpolation Over A Triangle 




where: 

FStart = initial fog value. 

dFdx = Fog gradient in the X direction. 

dFdyDom = Fog gradient along the dominant edge of a 
primitive . 

Note that for fogged lines the dFdx delta is not required. 

The fog index is specified as an 18-bit fixed point value. The format is 2s 
complement with 2 bits integer and 1 6 bits fraction, see Figure 4-41 . 
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Figure 4-4 1. Fog Interpolant Fixed Point Format 
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The fog DDA calculates a fog index value which is clamped to lie in the range 
0.0 to 1 .0 before it is used in the following fogging equations. 

4.12.5 Fogging Equation 

The fogging equation is: 

C = fCi + (l-f)Cf 

where: 

C = outgoing fragment color 
Cf = fog color 

Ci = incoming fragment color 
f = fog index 

The equation is applied to the color components, red, green and blue; alpha 
is not modified. Figure 4-42 shows how the fogging would typically affect a 
scene. Initially no fogging occurs, f >=1 .0, then a region of linear combination 
of the fragment color and fog color occurs 1 .0 < f > 0.0, followed by a region 
of constant fog color, f <= 0.0. 
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Figure 4-42. Fogging 
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4.12.6 Alpha Blending 

Two types of alpha blending are supported, one that is common to RGB and 
Ramp applications, and one that is specific to Ramp applications. Alpha 
blending combines the fragment color, potentially after texture and fog have 
been applied, with that stored in the framebuffer. 

Data from the framebuffer is in the raw format, so it must be converted to the 
internal format before the blend can be done. This is achieved by setting the 
ColorFormat and ColorFormat Extension fields in the AlphaBlendMode 
register. 

In some situations blending is desired when no retained alpha buffer is pres- 
ent. In this case, the alpha value which is considered to be read from the frame- 
buffer will be set to 1 .0. The NoAlphaBufferbit in the AlphaBlendMode register 
controls this. 

4. 12.6. 1 Common Blend Mode 

The common blend operation is defined as: 

C = C S A S + C d (l-A s ) 

where: C is the output color, C s is the source color , A s is the source alpha 
and C d is the destination color read from the framebuffer. Setting the Operation 
field to Blend in the AlphaBlendMode register achieves this. 
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See The OpenGL Reference Manual and The OpenGL Programming Guide 
from Addison-Wesley for more details on this style of alpha blending. 

4.12.7 Ramp Blend Mode 

The alternative blend mode is called PreMult and does the operation: 

C = C s + C d (l-A s ) 

4.12.8 Image Formatting 

The Alpha Blend and Color Format units can be used to format image data into 
any of the supported the TVP401 framebuffer formats. 

Consider the case where the framebuffer is in RGBA 5.5.5.1 mode, and an 
area of the screen is to be uploaded and stored in an 8-bit RGB 3:3:2 format. 
The sequence of operations is: 

□ Set the Rasterizer as appropriate ( see subsection 4.3.9) 

□ Enable framebuffer reads 

□ Disable framebuffer writes and set the UpLoadData bit in the FBWrite- 
Mode register 

□ Enable the Alpha Blend unit, set the operation to Format (assuming no 
alpha blending is needed) and set the color mode to RGBA 5.5.5.1 . This 
can all be achieved by setting the appropriate fields in the AlphaBlend- 
Mode register. 

□ Set the Color Format unit to format the color of incoming fragments to an 
8-bit RGB 3:3:2 framebuffer format. 

The upload now proceeds as normal. This technique can be used to upload 
data in any supported format. 

The same technique can be used to download data which is in any supported 
framebuffer format. In this case the Rasterizer is set to synchronize with 
FBData (rather than Color), framebuffer writes are enabled and the 
UpLoadData bit cleared. 

4.12.9 Registers 

The TextureColorMode register (see Figure 4-43) is used to enable and 
disable texturing (qualified by the texture application bit in the Render 
command register). The KsDDA and KdDDA bits enable the internal DDAs 
and should be set for modulate or highlight Ramp texture application modes. 
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The Texture Type field differentiates between Ramp and RGB application 
modes. Combinations of decal, modulate and highlight are supported with 
Ramp Application Mode. 



Figure 4-43. TextureColorMode Register 
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The TexelO register holds the texture value. This may be loaded automatically 
by the Texture Read unit, or supplied from the host for a procedural texture. 
Figure 4-44 and Figure 4-45 show texture values in RGB and YUV formats 
respectively. This register is also used to hold the background color for the bit- 
mask and stipple tests. If the tests fail then this color can be used in place of 
that from the Color DDA unit. 



Figure 4-44. TexelO Register - RGB format 
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Figure 4-45. TexelO Register - 
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The six registers: KsStart, dKsdx, dKsdyDom, KdStart, dKddx and dKddyDom 
hold the start, dx and dyDom parameters for Ks and Kd. The format is 2s 
complement 2.1 6 fixed point format (1 bit sign, 1 bit integer, 1 6 bits fraction) 
with an effective range of ±1 .999. The values of Ks and Kd at each vertex are 
used to calculate the gradient values in much the same way as the Z gradients, 
when interpolating depth (see subsection 4.2.5). 

The FogMode register (see Figure 4-46) is used to enable and disable fogging 
(qualified by the fog application bit in the Render command register). Setting 
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Fog Test causes fragments with negative fog values to be rejected as 
described in subsection 4.12.2. 



Figure 4-46. FogMode Register 
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Additional fog registers are, FogColor, which holds the fog color in the standard 
color format. FStart, dFdx & dFdyDom which control the fog DDA and are 
formatted in 2s complement 2.16 fixed-point format as described previously. 

Blending is controlled by the AlphaBlendMode register as shown in Figure 
4-47. 



Figure 4-47. AlphaBlendMode Register 
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The color format and order is needed as the destination color is read from the 
framebuffer and needs to be converted into the internal the TVP4010 
representation; it should therefore be set as appropriate for the framebuffer. 
The operation can be either format, blend, or PreMult. 

4.12.10 Texture Application Example 

The following pseudocode is an example of a texture mapped trapezoid: 

// Set-up Texture/Fog/Blend unit 
textureColorMode. Enable = TVP4010_TRUE 

textureColorMode . ApplicationMode = TVP4 010_TEXTURE_MOD- 
ULATE 



TextureColorMode (textureColorMode ) 
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1 1 Render with texture enabled in render command 
// render. TextureEnable = TVP4010_TRUE 



4.12.11 FogExample 



A Gouraud shaded, fogged RGBA trapezoid, with the fog color set to white. 
See subsection 4.2.5 for details of how to calculate depth delta values - fog 
values are calculated in a similar way. 

/ / Enable the Color DDA unit in Gouraud shading mode 

colorDDAMode . Unit Enable = TVP4 010_ENABLE 

colorDDAMode . Shade = TVP4 010_GOURAUD_SHADE_MODE 

ColorDDAMode (colorDDAMode) 

/ / Enable the Fog unit 

fogMode.FogEnable = TVP4010_TRUE 

FogMode (fogMode) 

// Set the fog color to white 

FogColor (OxFFFFFFFF) 

// Load the color start values and deltas for dominant 
edge 

// and the body of the trapezoid 

RStartO // Set-up the red component start value 

dRdx ( ) // Set-up the red component increments 

dRdyDom ( ) 

GStartO // Set-up the green component start value 

dGdx ( ) // Set-up the green component increments 

dGdyDom ( ) 

BStartO // Set-up the blue component start value 

dBdx () // Set-up the blue component increments 

dBYDom ( ) 

// Load the start value and delta for dominant edge 
// and the body of the trapezoid 

// Note that the fog deltas are calculated in the same 
// way as the color deltas 

FStartO // Set-up the fog component start value 

dFdx ( ) // Set-up the fog component increments 

dFdyDom ( ) 

// When issuing a Render command the FogEnable bit 
// should be set in addition to the fog unit being 
// enabled: 

// render .FogEnable = TVP4 010_TRUE 
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4.13 Color Format Unit 

The Color Format unit converts from the TVP4010 internal color 
representation to a format suitable to be written into the framebuffer. This 
process may optionally include dithering of the color values. If the unit is 
disabled, the color is not modified in any way. 

4.13.1 Color Formats 

The framebuffer may be configured to be RGBA or Color Index (CI). Table 3.1 
shows the full list of color modes supported by the TVP401 0. The R, G, B and 
A columns show the width of each color component. The least significant bit 
position is 0. For the Front and Back Modes the value is repeated in both 
buffers, and writemasks may be used to update only one buffer. In CI mode, 
the index is repeated in all streams. 

4.13.2 Color Dithering 

The TVP401 uses an ordered dither algorithm to implement color dithering. 
It also has a line dither mode which uses a different algorithm which generally 
gives better results for lines because it is independent of orientation. This 
mode is not available for trapezoids. 

If the Color Format unit is disabled, the color components RGBA are not 
modified and is truncated when placed in the framebuffer. In CI mode, the 
value is truncated to the nearest integer. In both cases the result is clamped 
to a maximum value to prevent overflow. 

The TVP401 supports 8:8:8:8 RGBA format for 2d operations only. When this 
mode is selected and dithering is enabled, it results in 5.5.5.1 RGBA quality for 
each 32-bit pixel. This can be used when the window manager needs to be set 
up for true color at the same time as 3D windows are required. 

In some situations only screen coordinates are available, but window relative 
dithering is required. This can be resolved by setting up the optional X and Y 
offsets which get added to the coordinates before the dither tables are 
indexed. Each offset is a 2-bit number which is supplied for each coordinate. 
The XOffset and YOffset fields in the DitherMode register control this operation 
and should be set to zero if window relative coordinates are used. 

4.13.3 ForceAlpha 

The Color Format unit can force the alpha value to be either x or the 
maximum 0xF8, or leave it unchanged. This can be used to implement 
overlays. See section 6.6 for a detailed description. 
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4.13.4 Registers 



One register controls the operation of this unit, DitherMode, and its layout is 
shown in Figure 4-48: 



Figure 4-48. DitherMode Register 

31 24 16 



— i — i — i — i — i — r 
Reserved 

I I I I I L 



ColorFormatExtension 

Reserved 




ForceAlpha 

DitherMethod 
ColorOrder 



X offset 
Y offset 



Dither enablex 

Unit Enable 



Color format 



The X and Y offset fields are for window relative dithering. Color order specifies 
RGB or BGR color order. The Color format and Color format extension fields 
control color depth and options are given in Table 4-1 . 



4.13.5 Dither Example 

To set the framebuffer format to RGB 3:3:2 and enable dithering: 

// 332 Dithering 

ditherMode.UnitEnable = TVP4010_TRUE 
ditherMode.DitherEnable = TVP4010_TRUE 
ditherMode . ColorMode = TVP4 010_COLOR_FORMAT_RGB_332 
DitherMode (ditherMode) // Load register 

4.13.6 3:3:2 Color Format Example 

To set the framebuffer format to RGB 3:3:2 and disable dithering: 

// 332 No Dither 

ditherMode.UnitEnable = TVP4010_TRUE 
ditherMode.DitherEnable = TVP4010_FALSE 
ditherMode . ColorMode = TVP4 010_COLOR_FORMAT_RGB_332 
DitherMode ( ditherMode ) // Load register 

4.13.7 8:8:8:8 Color Format Example 

To set the framebuffer to RGBA 8:8:8:8 and not dithered: 
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II 8888 Dithered (No effect as 8 bit components are 
// not dithered) 

ditherMode .UnitEnable = TVP4010_TRUE 
ditherMode.DitherEnable = TVP4 010_FALSE 
ditherMode. ColorMode = TVP4010_COLOR_FORMAT_RGBA_8888 
DitherMode (ditherMode) // Load register 
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4.14 Logical Op Unit 

The Logical Op unit performs three functions: 

■ optional control of a special the TVP4010 mode that allows high- 
performance flat-shaded rendering. 

■ logic operations between the fragment color (source color) and a 
value from the framebuffer (destination color) 

■ software writemasking 

4.14.1 High Speed Flat Shaded Rendering 

A special the TVP4010 rendering mode is available that allows high speed 
rendering of unshaded images. Note this method is not as fast as block fills but 
is less restrictive. To use the mode, the following constraints must be satisfied: 

■ Flat shaded primitive 

■ No dithering required 

■ No logical ops 

■ No stencil or depth testing required 

■ No alpha blending 

The following are available: 

■ Bit masking in the Rasterizer 

■ Area and line stippling 

■ User and Screen Scissor test 

If all the conditions are met, then high speed rendering can be achieved by 
setting the FBWriteData register to hold the framebuffer data (in raw 
framebuffer format) and setting the UseConstantFBWriteData bit in the 
LogicalOpMode register. All unused units should be disabled. 

This mode is most useful for 2D applications or for clearing the framebuffer 
when the memory does not support block writes. Note that FBWriteData 
register should be considered volatile when context switching. 
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4.14.2 Logical Operations 

The logical operations supported by the TVP4010 are listed in Table 4-17. 

Table 4- 1 7. Logical Operations 



Mode 


Name 


Operation 





Clear 





1 


And 


S&D 


2 


And Reverse 


S&-D 


3 


Copy 


S 


4 


And Inverted 


~S& D 


5 


No-op 


D 


6 


Xor 


S A D 


7 


Or 


S | D 


8 


Nor 


~(S | D) 


9 


Equivalent 


~(S A D) 


10 


Invert 


~D 


11 


Or Reverse 


S | ~D 


12 


Copy Invert 


~S 


13 


Or Invert 


~S | D 


14 


Nand 


~(S & D) 


15 


Set 


1 



Where: S = Source (fragment) Color, D = Destination (framebuffer) Color 

For correct operation of this unit in a mode that takes the destination color, 
configure the TVP4010 to allow reads from the framebuffer using the 
FBReadMode register. See section 4.10 for more details. 

The TVP4010 makes no distinction between RGBA and CI modes when 
performing logical operations. However, logical operations are generally only 
used in CI mode. 

4.14.3 Software Writemasks 

Software writemasking is normally implemented only when Hardware 
writemasking is unavailable. It is controlled by the FBSoftwareWriteMask 
register. The data field has one bit per framebuffer bit that when set, allows the 
corresponding framebuffer bit to be updated. When reset, it protects the bit 
from being written. Software writemasking is applied to all fragments and is not 
controlled by an enable/disable bit. However, it may effectively be disabled by 
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setting the mask to all ones. If the mask is not all ones, the ReadDestination 
bit must be enabled in the FBReadMode register to correctly use software 
writemasks. See the Framebuffer Read/Write section for details of how to 
enable/disable framebuffer reads. 

The software writemask MUST be set to all ones, except when software write- 
masking is explicitly required. 

4.14.4 Registers 

The operation of the unit is controlled by the LogicalOpMode register as shown 
in Figure 4-49. 

Figure 4-49. LogicalOpMode Register 



31 


24 


16 


8 







i i i i i i i i i i i i i i i i i i i i i i i i i 

Reserved 

i i ii 




i i i 
LogicOp 

i i i 





UseConstantFBWriteData 

Logical Openable 

4.14.5 XOR Example 

To set the logical operation to XOR: 

// Set framebuffer to allow reads 
/ / Not shown 

logicalOpMode.UnitEnable = TVP4 010_ENABLE 
logicalOpMode . LogicalOp = TVP4 010_LOGICOP_XOR 
LogicalOpMode (logicalOpMode) // Load register 

4.14.6 Software Writemask Example 

To set the logical operation to COPY, enable the software writemask, and write 
to the green component in an-8bit framebuffer configured in 3:3:2 RGB mode: 

// Set framebuffer to allow reads 
/ / Not shown 

ditherMode .UnitEnable = TVP4 010_ENABLE 
ditherMode.DitherEnable = TVP4 010_ENABLE 
ditherMode . ColorMode = TVP4 010_COLOR_FORMAT_RGB_332 
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DitherMode (ditherMode) // Load register 
logicalOpMode.UnitEnable = TVP4 010_ENABLE 
logicalOpMode . LogicalOp = TVP4010_LOGICOP_COPY 
LogicalOpMode (logicalOpMode) // Load register 
FBSoftwareWriteMask (0xFFFFFFE3) 
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4.15 Host Out Unit 

The Host Out Unit controls which registers are available at the output FIFO, 
gathers statistics about rendering operations (picking and extent testing) and 
controls synchronization of the TVP4010 with the host. 

4.15.1 Filtering 

Filtering controls the data made available at the output FIFO. There are the 
following categories: 

□ Depth, Stencil, Color: These are data values associated with a fragment 
which has been read from the localbuffer or framebuffer, or generated 
using the UpLoadData flag in the Framebuffer Write Unit. This category 
is normally associated with uploading data to the host. 

□ Synchronization : A single register, Sync, which is used to synchronize the 
TVP4010 and flush the graphics pipeline. 

□ Statistics: The registers associated with extent checking and picking. 

The filtering is controlled by the FilterMode register which has 2-bit fields for 
each category. These fields select whether the register tag and/or register data 
are passed to the output FIFO. The format of the FilterMode register is shown 
in Table 4-18. 



4-88 



Host Out Unit 



Table 4-18. Filter Modes 


Register Category 


Tag Con- 
trol Bit 


Data 
Control 
Bit 


Description 


Reserved 





1 




Reserved 


2 


3 




Depth 


4 


5 


This is the data from image upload of the Depth (Z) 
buffer. 


Stencil 


6 


7 


This is the data from image upload of the Stencil buffer. 


Color 


8 


9 


This is the data from image upload of the Framebuffer 
(FBColor). 


Synchronization 


10 


11 




Statistics 


12 


13 


This is the data generated following a command to read 
back the results of the statistic measurements: 
PickResult, MaxHitRegion, MinHitRegion 


Reserved 


14 


15 





Note that the filter unit must be set appropriately before any synchronization 
can take place. 



4.15.2 Statistic Operations 

There are two statistic collection modes of operation; picking and extent 
checking. Picking is normally used to select drawn objects or regions of the 
screen. Typically, extent checking is used to determine the bounds within 
which drawing has occurred so that a smaller area of the framebuffer can 
subsequently be cleared. 

Statistic collection is controlled using the StatisticMode register. 

4.15.2.1 Picking 

In picking mode, the active and/or passive fragments have their associated XY 
coordinates compared against the coordinates specified in the MinRegion and 
MaxRegion registers. If the result is true, then the PickResult flag is set, other- 
wise it holds its previous state. The compare function can be either Inside or 
Outside. Before picking can start, the ResetPickResult register must be 
loaded to clear the PickResult flag. 

The MinRegion and MaxRegion registers are loaded to select the region of 
interest for picking. A coordinate is inside the region if: 

X min - X < X max 

Y ■ < Y < Y 
1 min 1 max 
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where X and Y are from the fragment and the min/max values are from 
MinRegion and MaxRegion registers. This comparison is identical to the one 
used in the scissor tests. 

The following stages are required for picking: 

1) load ResetPickResult, MinRegion and MaxRegion registers 

2) Set up the FilterMode to allow statistic commands out of the TVP401 

3) Draw the primitives. 

4) Send a PickResult command. 

5) Poll the output FIFO waiting for the PickResult to have passed through the 
TVP4010. 

Block fills are ignored by the picking operation. 

4. 15.2.2 Extent Checking 

In extent mode, active and/or passive fragments have their associated XY 
coordinates compared to the MinRegion and MaxRegion registers and if found 
to be outside the defined rectangular region, then the appropriate register is 
updated with the new coordinate(s) to extend the region. The Inside/Outside 
bit has no effect in this mode. Block fills are included in the extent checking if 
the StatisticMode register is set to include spans. 

The MinRegion and MaxRegion registers are loaded to select the maximum 
value (MinRegion) and minimum value (MaxRegion) for extent checking. A 
coordinate is inside the region if: 

X min - X < X max 
Y ■ < Y < Y 

where X and Y are from the fragment and the min/max values are from 
MinRegion and MaxRegion registers. This comparison is identical to the one 
used in the scissor tests. 

Once all the necessary primitives have been rendered, the results can be 
found using the MinHitRegion and MaxHitRegion commands, that cause the 
contents of the MinRegion and MaxRegion registers respectively to be written 
into the output FIFO (under control of the FilterMode register). 

4.15.3 Synchronization 

The Sync command register provides a means of ensuring that the TVP401 
has completed all outstanding actions such as localbuffer and framebuffer 
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accesses. Sync is filtered and written to the output FIFO in a manner similar 
to that for the other registers. The host can either poll for Syncs by reading the 
output FIFO or await a Sync interrupt. 

If generation of an interrupt is required, then the most significant bit of the Sync 
command register must be set, and the filtering must be set up to at least allow 
the Sync to be written into the FIFO. If the FilterMode is set up so the Sync is 
not written to the FIFO, then Sync interrupts will not be generated. The actual 
interrupt will not occur until the Sync data or tag has passed through the 
TVP4010 and is on the output of the FIFO. This to allow low level 
resynchronization between the graphics core and PCI clock domains. The 
FIFO has an extra bit in width to accommodate the interrupt signal. When both 
the data and tag are written into the FIFO, only the first entry in the FIFO will 
cause the interrupt (assuming an interrupt was requested). 

The remaining bits in the Sync data field are free and can be used by the host 
to identify the reason for the Sync. 



4.15.4 Registers 

Filtering is controlled by the FilterMode register as shown Figure 4-50. 
Figure 4-50. FilterMode Register 

31 24 16 8 



t — i — i — i — i — i — i — i — i — i — i — i — i — i — r 
Reserved 



~i — i — i — i — i — i — i — i — i — i — i — i — i — r~ 
Individual bits defined above 

j i i i i i i i i i i i i i_ 



Statistic collection is controlled by the StatisticMode register as shown Figure 
4-51. 



Figure 4-51. StatisticMode Register 

31 24 



16 



~~i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — r 
Reserved 

J I I I I I I I I I I I I I I I L 



Include Span: 
Compare Function 
Monitor Culled Fragments 

Monitor Pixels Written 




Statistics Type 
Enable Statistics 



The Include-Spans bit allows control over whether or not block fills are 
included in the returned information. 
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ResetPickResult is used to clear the pick flag, see Figure 4-52. The data field 
for this register is unused. 

Figure 4-52. PickFtesult Register 

31 24 16 8 

— i — i — i — i — i — i — i — I — i — i — i — i — i — i — i — I — i — i — i — i — i — i — i — I — i — i — i — i — i — i — i — 

Reserved 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I 

Pick Flag 

MinRegion, MaxRegion registers are used to load picking/extent regions, and 
MaxHitRegion and MinHitRegion are used to read the registers back. The 
format is 1 6-bit 2s complement numbers with Y in the most significant part and 
X in the least significant part of the word. 

Setting the most significant bit of the Sync register will request a Sync interrupt. 
Bits 0-30 are available for the user. 

4.15.5 Filter Mode Example 

The following pseudocode implements an example filter mode: 

// Set up Filter mode to only permit read back of 
// synchronization tag and data 
FilterMode (OxOCOO) // Set bits 10 & 11 

4.15.6 Picking Example 

Set the statistic mode to picking and detect any active fragments in the region 
0x0 <= x < 0x1 00, 0x0 <= y < 0x1 00. Render some primitives, then read back 
the results. 

// Set filter mode as above 
FilterMode (OxOCOO) // Set bits 10 & 11 
// Set statistic mode 
MinRegion ( ) 

MaxRegion (0x100 | 0x100 << 16) 
// Clear the picking flag 

ResetPickResult ( 0x0 ) // Data not used 
// Now render primitives 
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Render (render) // All units set as appropriate 

// All rendering finished. 

// Set the filter mode to allow read back of Syncs and 

// statistic information (tag and data) 

FilterMode (0x3C00) // Set bits 10 to 13 

// Write to the PickResult register 

PickResult (0x0) // Data not used 

// Now read the PickResult from the output FIFO (not 
shown) 

4.15.7 Sync Interrupt Example 

Generate a synchronization interrupt and encode some user defined data 
(0x34) in the lower 31 bits of the Sync register. 

// Set up Filter mode to only permit read back of 
// synchronization tag and data 
FilterMode (OxOCOO) // Set bits 10 & 11 

// Write to the Sync register with the top bit (bit 31) 
set and 

// user data encoded into the lower bits (0-30) 
sync = (0x1 << 31) | (0x34 & 0x7FFFFFFF) 
Sync ( sync) 

// Now wait for the sync interrupt. Not shown. 
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Initialization 



This chapter outlines the requirements for initializing the TVP4010. 

Topic Page 

5.1 Initializing the TVP4010 5-2 

5.2 System Initialization 5-3 

5.3 Window Initialization 5-7 

5.4 Application Initialization 5-10 

5.5 Bypass Initialization 5-11 
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5.1 Initializing the TVP4010 

This section describes how to initialize the TVP401 following reset and prior 
to carrying out rendering operations. 

Initialization falls broadly into three areas, though in different systems precise 
responsibilities can vary: 

□ System initialization covers the setting up of the PCI bus, memory and 
video output. This information typically is only initialized once following 
reset. 

□ Window initialization, also referred to as context initialization, covers the 
setting of the base address of the current rendering window and its color 
format. This must occur at reset, but will need updating each time the 
TVP401 starts drawing to a new window. 

□ Application initialization covers those states that are typically dynamic; 
enabling and disabling depth testing are examples. Again the application 
state must be set at reset, but is likely to be updated relatively frequently. 

To make use of the full functionality of the TVP4010, consult the relevant 
sections in chapter 4 Graphics Programming. Examples are given which make 
use of the pseudocode conventions given in Appendix B. 

Note that in general the graphics registers (those listed in Appendix A, as 
opposed to those documented in the Data Manual) are not hardware initialized 
to specific values at reset. In the examples that follow, it is assumed that the 
data structures used to load these registers are initialized to zero. Thus, bit 
fields that are not set explicitly, will default to zero. 
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5.2 System Initialization 

5.2.1 PCI 

There are a set of PCI related registers which can be interrogated for 
information about the chip, for example its revision and device ID. Some of 
these PCI related registers will need to be set up at reset, for instance to 
configure the base addresses of the different memory regions of the chip. For 
more details refer to the TVP4010 Data Manual and the PCI Local Bus 
Specification, Revision 2.1 . 

5.2.2 Memory Configuration 

The memory interface control registers should be programmed to reflect the 
type and amount of memory fitted. The registers are specified in the TVP401 
Hardware Reference Manual. 

5.2.3 SVGA and Internal Video Timing Registers 

Details for programming the SVGA registers can be found in the TVP4010 
Data Manual. 

The core video timing generator should be programmed to reflect the timings 
of the monitor being used and the screen resolution and color depth. Note that 
there is also a SVGA video timing register(VTR) and care must be taken to 
ensure the correct one is enabled at the right time. To change from SVGA to 
core display mode, two stages are required. Firstly, the core VTR must be set 
up and then VGAControlReg must be loaded (the EnableVGADisplay bit set 
to 0). 

Details of programming the registers for both VTRs can be found in the 
TVP4010 Data Manual. 

5.2.4 Screen Width 

The width of the screen is initialized by setting the three partial products fields 
in the FBReadMode, LBReadMode and TextureMapFormat registers. Note 
that the width is in pixels, not in bytes, so the same values apply regardless 
of framebuffer depth, for a given screen resolution. A full list is given in 
Appendix C. 

To initialize the screen to be 1024 pixels wide, set the registers as follows. 

fbReadMode.PPO = 5 
fbReadMode.PPl = 5 
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fbReadMode.PP2 = 4 
FBReadMode (fbReadMode) 
IbReadMode.PPO = 5 
IbReadMode.PPI = 5 
IbReadMode . PP2 = 4 
LBReadMode (IbReadMode) 
textureMapFormat .PPO = 5 
textureMapFormat . PP1 = 5 
textureMapFormat .PP2 = 4 
TextureMapFormat (textureMapFormat ) 

Note that the TVP401 supports a maximum screen resolution of 1 536 in width 
and 1024 in height. 

5.2.5 Screen Clipping Region 

The TVP401 supports a screen scissor clip which should be set at system 
initialization, and a user scissor clip which should initially be disabled. 
Assuming that the FBWindowBase and LBWindowBase registers are set 
appropriately, then setting the screen clip prevents writing outside the 
framebuffer memory (and localbuffer), which could have undesirable results. 
The following example would be appropriate for a resolution of 1024 x 768 
pixels: 

screenSize.X = 1024 
screenSize.Y = 768 
ScreenSize (ScreenSize) 

scissorMode . ScreenScissorEnable = TVP4 010_ENABLE 
scissorMode . UserScissorEnable = TVP4 010_DI SABLE 

ScissorMode (ScissorMode) 

5.2.6 Localbuffer and Framebuffer Configuration 

Since the TVP4010 supports a unified memory architecture, it must be 
decided how the memory is to be partitioned between framebuffer, localbuffer 
and texture memory. A typical configuration might be to allocate two screen 
sized buffers: one for the visible screen, the other for the 3D back buffer. Then 
allocate a localbuffer: this is always 1 6 bits per pixel; and allow the remainder 
to be used for texture memory. The localbuffer and texture memory can be 
considered to have different shapes to the front and back buffers. For example, 
suppose that a screen resolution of 800 x 600 at 8 bits per pixel is required, 
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then the following offsets could be used. Each offset is a count in pixels from 
the start of memory. 

Front buffer: pixel offset 

Back buffer: pixel offset 480000 (= 600*800 bytes) 

Local buffer: pixel offset 480000 (offset in 16 bit pix- 
els) 

Texture memory: byte offset 1920000 (= 2*600*800 + 
600*800*sizeof (USHORT) ) 

The size of the pixel depends on the buffer being considered. Hence the offset 
to the back buffer and the localbuffer appear to be the same but one is 
measured in bytes, the other in shorts. 

These offsets should be saved as software copies to used as required. For 
example, to select the front buffer for rendering, the FBPixelOffset register 
would be set to 0; to select the back buffer it would be set to the Back buffer 
pixel offset. The localbuffer offset should be added to the window base offset 
whenever the LBWindowBase register is updated. The value loaded into the 
TextureBaseAddress is a count of the number of texels from the start of 
memory. Thus the byte offset should be modified to be a texel count when 
used. In practice, some sort of texture allocation scheme will be needed where 
textures are allocated starting at the texture memory offset. The final value 
loaded into the TextureBaseAddress register will be the texture memory offset 
+ offset to the required texture with the final value converted to a texel count 
from the start of memory. 

The TVP4010 supports a range of localbuffer configurations. During 
initialization, fields in the LBWriteFormat and LBReadFormat registers should 
be set to appropriate values. For example: 

IbReadFormat . DepthWidth =3 // 15 bit depth buffer 

IbReadFormat . StencilWidth =3 // 1 bit stencil 
LBReadFormat (IbReadFormat) 

lbWriteFormat .DepthWidth =3 // 15 bit depth buffer 

lbWriteFormat . StencilWidth =3 1/1 bit stencil 
LBWriteMode (lbWriteFormat) 

Note it is possible to dynamically change the number of bits allocated to the 
depth and stencil buffers, for instance on a per window basis. 

Set the framebuffer and localbuffer read units to their default data sources: 

fbReadMode.DataType = TVP4 010_FBDATA 
FBReadMode (fbReadMode) 
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IbReadMode.DataType = TVP 4 01 0_LBDEFAULT 
LBReadMode (IbReadMode) 

The following registers are typically only needed for certain specialized 
operations. Normally their offsets will be zero. 

FBSourceOf f set (0) 
FBPixelOf f set (0) 
LBSourceOf fset (0) 



5.2.7 Host Out Unit 



Under some circumstances it is necessary to synchronize with the TVP401 0. 
This is controlled through the Sync command. The host out FIFO should 
normally be initialized so as to output the Sync tag and data (they can be 
filtered out). 

In addition the host out unit should normally be set to filter out all other output 
data, otherwise the host software must regularly poll the output FIFO to keep 
it drained and prevent it freezing the pipeline. For example: 

filterMode . Depth = TVP4010_NULL 

filterMode . Stencil = TVP4010_NULL 

filterMode . Color = TVP4010_NULL 

FilterMode . Synchronization = TVP 4 01 0_F I LTER_TAG_AND_DATA 

/ / Allow Syncs through 

filterMode . Statistics = TVP4 010_NULL 
FilterMode (filterMode) 



5.2.8 Disabling Specialized Modes 



Some operations should be disabled until they are need. See Chapter 4, 
Graphics Programming tor more details on their use. 

window. LBUpdateSource = TVP4010_TRUE 
window. ForceLBUpdate = TVP4 010_FALSE 
window. DisableLBUpdate = TVP4010_TRUE 
Window (window) 
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5.3 Window Initialization 

The TVP401 supports the concept of a window origin, and makes it relatively 
simple to implement systems which allow different color formats to coexist in 
different windows. 

5.3.1 Color Format 

The Color Format unit and the alpha blend unit should be initialized to an 
appropriate color format at reset. The units support a variety of different 
formats, listed in Table 3-1 of Chapter 3. 

For example to render in 3:3:2, 8-bit color format, the following would be 
needed: 

ditherMode.ColorFormat = TVP4 010_COLOR_FOR- 
MAT_RGB_332_FRONT 

DitherMode (ditherMode) 

alphaBlendMode.ColorFormat = TVP4 010_COLOR_FOR- 
MAT_RGB_332_FRONT 

AlphaBlendMode (alphaBlendMode) 

To enable dithering use the following: 

ditherMode. XOff set = 

ditherMode. YOffset = 

ditherMode. DitherEnable = TVP4 010_ENABLE 

ditherMode .UnitEnable = TVP4 010_ENABLE 

DitherMode (ditherMode) 

Note that the Color Format unit is normally always enabled even if dithering 
itself is not. This is because the unit handles color formatting as well as the 
dithering operation. 

5.3.2 Setting the Window Address and Origin. 

The TVP401 supports the concept of a current window origin. The origin of 
the window can be specified either as being in the Top Left or Bottom Left 
corner. This allows the user to pick the most appropriate coordinate system 
to use; for 3D graphics it is typically bottom left, whereas for window systems 
it is top left. Thus, for OpenGL set: 

fbReadMode . WindowOrigin = TVP4010_BOTTOM_LEFT_WINDOW_ORI- 
GIN 

FBReadMode (fbReadMode) 
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IbReadMode . WindowOrigin = TVP4010_BOTTOM_LEFT_WINDOW_ORI- 

GIN 

LBReadMode (IbReadMode) 

textureMapFormat .WindowOrigin = TVP4 010_BOTTOM_LEFT_WIN- 
DOW_ORIGIN 

TextureMapFormat (textureMapFormat ) 

The window origin is set in the Scissor unit. This information usually is provided 
by the window system. It will need updating if the window moves. As an 
example, if the position of the window is (200, 600) (using a bottom left 
coordinate system), the origin is specified as follows: 

WindowOrigin . X = 200 
WindowOrigin . Y = 600 
WindowOrigin (windowOr igin ) 

The base address of the window must also be established in the localbuffer 
read and framebuffer read units. The base address is the physical address that 
represents the base address of the window. Assuming the base address of the 
framebuffer represents the pixel in the top left corner of the screen, then for 
the example above the actual physical address of the bottom left pixel of the 
window will be set as follows: 

fbWindowBase = f bBaseAddress + 

(fbWidth * (fbHeight-1-600) + 200) 

FBWindowBase (fbWindowBase) 

lbWindowBase = lbBaseAddress + 

(lbWidth * (lbHeight-1-600) + 200) 

LBWindowBase (lbWindowBase) 

Where fbBaseAddress, fbWidth and fbHeight are the physical base address, 
width and height of the framebuffer (in pixels). fbBaseAddress and 
lbBaseAddress will have been precomputed as described previously in 
subsection 5.2.6. As with the WindowOrigin data, if the window moves, these 
registers must be updated. 

5.3.3 Writemasks 

Normally both the hardware (if present) and the software writemasks will 
initially be set to make all bitplanes writeable: 

FBSoftwareWriteMask (TVP4 010_ALL_WRITEMASKS_SET) 
FBHardwareWriteMask (TVP4 010_ALL_WRITEMASKS_SET) 

5.3.4 Enabling Writing 

Which buffers are enabled at any given time is window specific and should be 
considered for performance reasons. Performance will be improved if 
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unnecessary reads from, and writes to, buffers are disabled. For example, if 
the current rendering does not use depth or stencil testing, then reading and 
writing to the localbuffer may be disabled. The following example initializes the 
buffers to allow depth buffering and alpha blending: 

fbWriteMode.UnitEnable = TVP4 010_ENABLE 

FBWriteMode ( f bWriteMode ) 

lbWriteMode.UnitEnable = TVP4 010_ENABLE 

LBWriteMode (lbWriteMode) 

IbReadMode . ReadSourceEnable = TVP4 010_DISABLE 

IbReadMode.ReadDestinationEnable = TVP4 010_ENABLE 
LBReadMode (IbReadMode) 

fbReadMode. ReadSourceEnable = TVP4 010_DISABLE 

fbReadMode.ReadDestinationEnable = TVP4 010_ENABLE 
FBReadMode (fbReadMode) 

Note that to use software writemasking, the FBReadMode register ReadDes- 
tinationEnable field needs to besetifthewritemaskissettootherthan all ones. 



5.3.5 Setting Pixel Size 

The size of the pixels must be set so that the memory can be accessed 
correctly. To do this, use the FBReadPixel register as follows: 

fbReadPixel .PixelSize = TVP4010_16_BIT_PIXEL 
FBReadPixel (fbReadPixel) 

Three framebuffer pixel sizes are possible: 8, 1 6 and 32 bits. The localbuffer 
pixel size is fixed at 16 bits. 
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5.4 Application Initialization 



While an application is running, it may dynamically use features of the 
TVP4010 such as depth buffering, alpha blending, logical operations, etc. 
Initially, however, it is recommended that the respective units be disabled, to 
ensure that they are in a known state: 

areaStippleMode.UnitEnable = TVP4 010_DISABLE 
AreaStippleMode (areaStippleMode) 

depthMode.UnitEnable = TVP4 010_DISABLE 

DepthMode (depthMode) 

stencilMode.UnitEnable = TVP4 010_DISABLE 

StencilMode (stencilMode) 

textureAddressMode.UnitEnable = TVP4 010_DISABLE 
TextureAddressMode ( textureAddressMode ) 
textureReadMode.UnitEnable = TVP4 010_DISABLE 
TextureReadMode ( textureReadMode ) 

texelLUTMode .UnitEnable = TVP4 010_DISABLE 

TexelLUTMode (texelLUTMode) 

yuvMode. UnitEnable = TVP4 010_DI SABLE 

YUVMode (yuvMode) 

colorDDAMode .UnitEnable = TVP4 010_DISABLE 

ColorDDAMode (colorDDAMode) 

textureColorMode .UnitEnable = TVP4 010_DISABLE 
TextureColorMode (textureColorMode ) 
fogMode. UnitEnable = TVP4 010_DISABLE 

FogMode (fogMode) 

alphaBlendMode .UnitEnable = TVP4 010_DISABLE 

AlphaBlendMode (alphaBlendMode) 

logicalOpMode. UnitEnable = TVP4 010_DISABLE 

LogicalOpMode (logicalOpMode) 

statisticMode.EnableStats = TVP4 010_DISABLE 

StatisticMode (statisticMode) 
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5.5 Bypass Initialization 

The TVP4010 bypass mechanism gives direct access to memory that the 
TVP4010 uses to hold the framebuffer, localbuffer and textures. In some 
situations it is useful for an application to have direct access to this memory 
without going through the graphics processor. Initialization of PCI registers, in 
particular the Bypass Writemask register, covers initialization of the bypass 
mechanism. 

The memory configuration registers are generally set by resistors on the 
TVP4010 graphics card (this may vary between vendors but it is the 
recommended approach). The bypass writemask is undefined at boot time 
and should be set to -1. This register is at offset 0x1140 in region control 
space. 

Refer to the TVP4010 Data ManuaHor further details. 



Initialization 5-11 



Bypass Initialization 



5-12 



Chapter 6 



Programming Tips 



This chapter covers a variety of programming tips that make best use of the 
TVP401 0. The topics covered here are not exhaustive. 
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6.1 PCI Bus Issues 

6.1.1 Improving PCI bus bandwidth for Programmed I/O and DMA 

The simplest way to program the TVP4010 is by writing data values into the 
memory mapped registers, i.e. programmed I/O. This is appropriate for 
primitives which require few set-up parameters such as 2D lines. 

For more complex primitives such as Gouraud shaded triangles, where a 
significant number of registers must be loaded for each primitive, it may be 
more optimal to write directly to the TVP4010 FIFO input. 

The advantage of this mechanism is that it is then possible to use DMA burst 
transfers. The disadvantage of this method is that both the address of the 
register and the data value to be loaded must be written, apparently doubling 
the amount of data to be loaded. 

However, to improve bus bandwidth utilization, the registers have been 
grouped, into blocks which frequently need to be updated together. An indexed 
addressing mode is supported that allows a single address to be loaded, 
followed by the data for a whole set of registers. 

An additional mode is supported that allows a large number of data values to 
be loaded to the same register. This is useful for image downloads. 

For more detail, refer to section 2.3. 

6.1 .2 PCI burst transfers under Programmed I/O 

PCI bus burst transfers typically allow up to four times the bandwidth of 
individual transfers. However burst transfers are only initiated on the PCI bus 
when successive addresses are being written to (i.e. the byte address is 
incremented by 4). When using burst transfers to perform programmed I/O to 
load the TVP401 FIFOs, the TVP4010 multiply maps the FIFO input register 
throughout the range: 

0x00002000 to 0x00002FFF in region 

Thus when data is being loaded into the FIFO a software loop should be written 
which starts by writing the first data item at the lower extreme of this address 
range, and works towards the upper. For further information see section 3.2 

6.1.3 Using PCI Disconnect under Programmed I/O 

The PCI bus protocol incorporates a feature known as PCI Disconnect, which 
is supported by the TVP401 0. Once the TVP401 is in this mode, if the host 
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processor attempts to write to the full FIFO, the TVP401 chip will assert PCI 
Disconnect instead of the write being lost. This in turn causes the host 
processor to keep retrying the write cycle until it succeeds. 

This PCI disconnect feature allows faster download of data to the TVP4010 
since the host need not poll the InFIFOSpace register. It should be used with 
care because when the PCI Disconnect is asserted, the bus is effectively 
hogged by the host processor until such time as the TVP4010 frees up an 
entry in its FIFO. 

6.1.4 Using bus mastership (DMA) 

It is expected that most the TVP401 boards will support PCI bus mastership. 
This allows the on-board DMA to copy data from host memory into the 
TVP4010 FIFO. 

The use of PCI bus mastership has a number of benefits: 

□ PCI bus bandwidth utilization is generally much improved. 

□ PCI bus bandwidth is further improved because the driver software no 
longer needs to poll the FIFO flags to find how many entries are empty, 
before loading it. 

□ Overall system performance may benefit through increased parallelism 
between the TVP401 and the host, as the host can often perform useful 
work preparing the next DMA buffer once it has initiated a DMA transfer. 

See subsection 2.3.3 for more details on using DMA. 

6.1.5 Improving performance with DMA 

Using DMA interrupts can significantly improve performance as these allow 
useful work to be done in time that would otherwise be used by polling. 

Having multiple DMA buffers is usually advantageous. The size and number 
of buffers is dependent on operating system (OS) dependant issues such as 
context switch time. 
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6.2 Graphics Hyperpipeline 

6.2.1 Disable Unused Units 

Any unit that is not being used should be disabled. This will maximize pixel 
throughput in the graphics core. 

It is important to make sure that data is not being read from the texture buffer, 
localbuffer or framebuffer unless it is needed. For instance, it is possible to set 
up the localbuffer read unit such that the TVP401 reads per pixel information, 
such as Z or stencil buffer data, that is then discarded. The effect will be the 
same visually, but the cost in performance of making the memory accesses is 
very high. It is also important to set the LBDisableUpdate bit in the Window 
register if localbuffer writes are not needed. 

For optimal performance, hardware writemasks should be used in preference 
to software masks. 



6.2.2 Avoid Unnecessary Register Updates 

The TVP401 control registers maintain their state between primitives so they 
do not need to be updated unless the data needs to change. For example, the 
dY register might be set to +1 for a trapezoid and does not need to be reloaded 
until a line primitive is drawn. 

All delta values and start values are maintained across primitives, so if two 
triangles share a dominant edge, the start and dominant edge values do not 
need to be calculated or loaded twice. 

Similarly, window clipping need not reload all the registers for each clip 
rectangle. For example: Load the registers ready for a primitive to be drawn, 
then enter a loop which repeatedly loads the coordinates for a clip rectangle 
into the Scissor unit and then sends the Render command. Any number of clip 
rectangles can be processed in this way but the TVP401 requires only one 
setup for each primitive. 

6.2.3 Loading Registers in Unit Order 

To maximize performance, the control registers for the next primitive should 
be loaded into the TVP401 FIFO in unit order. Thus, the registers associated 
with the Rasterizer unit should be loaded first, then Scissor, Stipple, 
Localbuffer Read, and so on until the last unit to be loaded is the Host Out unit 
(if necessary). Then finally the relevant command register should be loaded. 

For the order of the units in the hyperpipeline, see to Figure 4-1 of Chapter 4. 
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6.2.4 Use of Continue Commands 

The continue commands provide an efficient method for drawing complex 
primitives without decomposing them into trapezoids or single lines. 

As far as context switching is concerned, each primitive should be treated as 
atomic. For example, if the TVP4010 context switched after the Render 
command for a triangle, but before it's associated ContinueNewDom 
command, the second part of the primitive may be drawn incorrectly. This is 
because the TVP4010 relies on the internal state set up by the Render 
command that would have been corrupted by any intervening context. 

A second requirement of the continue commands is that data written to the 
framebuffer or localbuffer before the continue, should not be read afterwards. 
This is not a common occurrence, but a possible situation is where two lines 
are drawn, the second joining the end of the first and being started by 
ContinueNewLine. If these lines are XORed, they will read the pixel they are 
about to write to. If the second line is at a sharp angle so that it folds back and 
overwrites some or all of the first line, the XOR operation is not correct because 
the pixels from the first line may not have been written to memory before the 
second line reads them. 

If this situation is likely to occur, a Sync command should be sent before the 
ContinueNewLine. This will ensure that all necessary writes complete before 
the corresponding reads. The software does not have to wait for the Sync to 
be read from the output FIFO; simply sending Sync is enough to ensure correct 
operation. 
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6.3 Area Filling Techniques 

This section describes the various techniques used to fill an area. 

6.3.1 Clearing Buffers Quickly 

Block writes are a feature of SGRAMs. Data written once to a single address 
can be applied to several addresses at the same time. This is a very fast way 
of filling areas of the screen, but there are restrictions on when they can be 
used which are covered elsewhere in this manual. 

Blockwrites are most obviously useful for clearing the screen, but because the 
TVP401 has a unified memory buffer it is possible to clear the localbuffer with 
blockwrites also. 

The extent checking in the host out unit can be used to indicate the area of the 
screen that has been written, so the screen clear can be limited to the minimum 
area necessary. 



6.3.2 Avoid Clearing Buffers 

Although block writes can be used for fast clearing of buffers, it is best not to 
clear them at all. If all pixels on the screen are drawn at least once per frame 
then the framebuffer does not need to be cleared. There is no need to clear 
the localbuffer either if the following procedure is followed. 

For even frames, put the viewer at a depth position of zero and draw objects 
in the lower half of the depth range with the depth test set to less than. For odd 
frames, put the viewer at the maximum depth value and draw objects into the 
upper half of the depth range with the depth test set to greater than. 

This loses half of the depth range, but avoids the need to clear the depth buffer 
if every pixel is touched at least once. 

6.3.3 Trapezoid Fills 

Blockwrites are most useful when clearing the framebuffer, but can be used 
to fill any trapezoid. 

Block fills, however, are limited to the area defined by the Rasterizer and 
cannot be changed by the stipple test. A quick filling technique that permits 
these tests can be achieved by setting the UseConstantFBWriteData bit in the 
Logic Op unit. When this bit is set, the required color should be loaded into the 
FBWriteData register in the format needed by the memory. All unrequired units 
should be disabled and the Rasterizer started. The fill can be done up to twice 
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as quickly using this method as opposed to the ConstantColor register 
method. 

Also remember that even though the display may be 8 bits per pixel, the chip 
can be told to draw at 32 bits per pixel. When this is done four pixels are plotted 
at one time, but the width of the region the Rasterizer covers should be 
reduced by a factor of four. Use the technique described in the tip about packed 
copies to get the Framebuffer Write Unit to calculate addresses correctly for 
32 bit pixels. The PackedDataLimits register can also be used to mask out 
unwanted pixels on the left and right edge. 
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6.4 Copies and Downloads 

6.4.1 Copies 

If the pixel size is 8 or 16 bits per pixel, the copy speed can be improved by 
moving more than one pixel at a time. This is achieved by setting the 
PackedCopy bit in the Framebuffer Read unit. This bit tells the TVP401 that 
it should pretend that the pixel size is 32 bits and calculate the addresses 
accordingly. The screen width does not need to be changed, nor does the base 
address or source offset value. The Rasterizer should be programmed to 
rasterize a rectangle that is a factor of four narrower (for 8-bit pixels) or a factor 
of two narrower (for 1 6 bit pixels) than the normal size. 

The groups of four or two pixels that are copied are all aligned to a 32-bit 
boundary, but if some of the edge pixels are not needed, the PackedDataLimits 
register can be used to mask them out. If the source and destination pixels 
have a different alignment, the RelativeOffset field in the FBReadMode 
register can be used to specify how the source needs to be shifted to line up 
with the destination. 

6.4.2 Downloads 

The same registers described in the previous tip can also be used to pack data 
during a download to the framebuffer or localbuffer. If the Rasterizer is set to 
sync on FBData, the data sent to the TVP4010 must be in the raw memory 
format. Four 8-bit pixels can be written at one time to the chip, and the 
PackedDataLimits register set to mask any unwanted pixels at the left and right 
edges; the RelativeOffset field is used to shift the alignment of the data as it 
is being stored. 

Downloads to the localbuffer can use LBData, but the Rasterizer does not 
support sync on LBData, so the data must be explicitly synchronized using the 
Sync command. Alternatively, downloads of stencil and/or depth data can be 
performed through the framebuffer write unit, allowing WaitForCompletion or 
sync on FBData to be used. 

6.4.3 Loading Textures 

The TVP4010 handles internal synchronization so that all necessary writes 
complete before reads for a given buffer. If the same data is treated as two 
different types, the chip must be explicitly synchronized. When a texture is 
downloaded, it is written to memory through the framebuffer write unit, but it 
is read through the Texture Read unit. This means that the chip must be 
synchronized between loading the texture and reading it; otherwise, there is 
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no insurance that the writes will have completed before the reads begin. A 
Sync command can be used to do this, or a WaitForCompletion command 
which does not require the polling of the output FIFO. 

Similarly, if the Framebuffer Write unit is used to clear the localbuffer, or the 
Texture Read unit is used in a copy operation, the chip must be synchronized. 
The chip will synchronize between localbuffer read and localbuffer write, and 
between framebuffer read and framebuffer write. Any operations that mix 
buffers need synchronization. 

If a texture is downloaded as a normal image, it can make use of the formatting 
in the chip to change color format and reorganize the data into rectangular 
patches. If texture is already in the required format, a fast texture download can 
be used. To use this, set the TextureDownloadOffset register to point to the 
start address of the texture (in 32-bit words). Write 32-bit texture data to the 
TextureData register and this will be written to memory without changing 
format. The TextureDownloadOffset will automatically increment following 
each write. If the texture is 8 bits per texel, then four texels must be supplied 
at a time. This method of texture download avoids the need to set-up the 
Rasterizer for image download and allows the state of the chip to be left 
unchanged. Even the framebuffer writes do not have to be enabled. 
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6.5 Multi Buffering 

6.5.1 Fast Double Buffering 

The TVP401 board designs can readily support a variety of double buffering 
mechanisms depending on the memory configuration and LUT-DAC used, 
including: 

□ BLT 

□ Full Screen 

□ Bitplane 

For further details see sections 3.4, 4.12.8, 4.12.9 and 4.13 of this user's 
guide. 

Note that optimal functionality may be achieved by mixing two or more of the 
above double buffering techniques. 

As a general performance note, it is best to send non-framebuffer related 
commands to the TVP401 following a SuspendUntilFrameBlank command. 
For example, any commands to clear the depth buffer between frames should 
be sent as these will not affect the framebuffer and will be executed while the 
TVP401 waits for the VBLANK. This allows better overlap between the host 
and the TVP401 0. In general any commands that will not cause rendering to 
the framebuffer to occur can be queued in the TVP4010 FIFO before waiting 
on VBLANK. 

6.5.2 Triple Buffering 

Most 3D systems support double buffering where one frame is displayed while 
the next frame is being drawn. To avoid display artefacts, the change between 
old and new buffers must happen during a vertical frame blank, but this 
imposes a granularity on the frame rate. If a scene takes slightly longer than 
one frame period to draw, it has to wait for another frame before it can display, 
so the frame rate halves. 

If three buffers are used, the quantization is removed and the system can 
continue to draw at maximum rate. 
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6.6 Overlays 

Overlay planes are useful for window systems and for games that move sprites 
across a static background. The TVP4010 does not have direct support, but 
if it is used with a RAMDAC such as the IBM526DB then overlays are possible. 

Overlays are only available with the 5:5:5:1 color format in a 32-bit pixel. The 
TVP401 5551 color formats copy the data into both 1 6-bit halves of the 32-bit 
pixel. The writemask is used to write either the upper or lower half to memory. 

The RAMDAC can be programmed to display a 16-bit pixel from either the 
upper or lower half of the 32-bit word; which one is displayed is set by bit 31 . 
Bit 31 corresponds to the alpha bit of the 1 6-bit pixel, and this can be forced 
to either 1 or by the Color Format unit. 

When drawing to the underlay (or main image), set the Color Format unit to 
force the alpha to zero, set the writemask to allow writes to the lower half of 
the word. When drawing to the overlay, set the Color Format unit to force the 
alpha value to 1 and writemask to allow writes to the upper half of the word. 

If the RAMDAC is set into the appropriate mode, pixels in the overlay half of 
the word will be drawn where alpha is one in the overlay and from the main 
image where it is zero in the overlay. 
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6.7 Memory Organization 

The amount of memory available to the TVP401 depends on the board it is 
fitted to. The most efficient way to allocate memory will depend on the needs 
of the system, but in general the display should be allocated at one end of the 
SGRAM and the localbuffer at the other end. This leaves a region between the 
two buffers in which textures can be stored. For optimal performance, each 
buffer (front color, back color, texture and depth) should reside in separate 
memory banks. Memory is organized as shown in Table 6-1 . 

Table 6-1. Memory Organization 



Memory size 


Banks 


Size per bank 


2Mb 


2 


1Mb 


4Mb 


4 


1Mb 


6Mb 


4 


1 or 2Mb 


8Mb 


4 


2Mb 



With 6Mb of memory, the first two banks will contain 1 Mb and the subsequent 
two, 2Mb. 
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6.8 Chroma Test 

Chroma key testing can be done without involving texture mapping. This is 
achieved by setting the TexelDisableUpdate field in the YUVMode register. 
This allows fragments to be rejected by chroma testing as part of a copy 
operation. The texels are read in and tested, and fragments rejected if the 
colors do not match. Setting the TexelDisableUpdate bit discards the data as 
soon as the test has been done which improves performance. 

This is described in more detail in subsection 4.9.1 
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This chapter gives details of the format of each of the Graphics registers for 
the TVP4010. The registers are listed alphabetically by name within their 
function, with the functions themselves listed alphabetically. 

□ Tag specifies the offset for this register from the base address of the 
region. 

□ Read/write indicates that the register can be both read and written. 

□ Write indicates that the register can only be written. The value of any read 
from this address is undefined. 

□ Reset Value specifies the value of the register following hardware reset. 
In general this is undefined for Graphics registers. 

In the diagrams: 

□ Reserved indicates bits that may be used in future members of the 
TVP4010 family. To ensure upwards compatibility, any software should 
not assume a value for these bits when read, and should always write 
them as zeros. 

□ Not used indicates bits that are adjacent to numeric fields. These may be 
used in future members of the TVP4010 family, but only to extend the 
dynamic range of these fields. The data returned from a read of these bits 
is undefined. When a Not Used field resides in the most significant 
position, a good convention to follow is to sign extend the numeric value, 
rather than masking the field to zero before writing the register. This will 
ensure compatibility if the dynamic range is increased in future members 
of the TVP4010 family. 

□ For enumeration fields which do not specify the full range of possible 
values, only the specified values should be used. An example of an 
enumeration field is the comparison field in the DepthMode register. 
Future members of the TVP4010 family may define a meaning for the 
unused values. 
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AlphaBlendMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



AlphaBlend Mode 
Texture/Fog/Blend 
0x0102 
Undefined 



31 



24 



16 



■ i i 



Reserved 



_l I I u 



ColorFormatExtension 

Reserved 



ColorFormat 
— i — i — i — i 



Operation 
-i — i — i — i — i — i— 



BlendType 
ColorOrder 
NoAlphaBuffer 



Enable 



Controls Alpha Blending. 

BitO Enable: 



= Disable 

1 = Enable alpha blending or 

color formatting 



Bit1-7 Operation: 



Mode 


Operation 


R 


G 


B 


A 


16 


Format 




Gd 


B d 




84 


Blend 


R S .A S + R d .(1-A s ) 


G S .A S + G d .(1-A s ) 


B S *A S + B d .(1-A s ) 


A S .A S + A d .(1-A s ) 


81 


PreMult 


R s + R d .(1-A s ) 


G s + G d .(1-A s ) 


B s + B d .(1-A s ) 


A s + A d .(1-A s ) 



Result of different operations. C s = source color component, C,j = destination 
color component. 



(See overleaf for description of the remaining bits). 
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Bit8— 11 Color Format: 

Internal Color Channel 



Format 1 


Color Order 


Name 


R 


G 


B 


A 





BGR 


8:8:8:8 


8@0 


8@8 


8@16 


8@24 


1 


BGR 


5:5:5:1 Front 


5@0 


5@5 


5(3)10 


1@15 


2 


BGR 


4:4:4:4 


4@0 


4@4 


4@8 


4(3)12 


5 


BGR 


3:3:2 Front 


3@0 


3@3 


2@6 





6 


BGR 


3:3:2 Back 


3@8 


3@11 


2@14 





9 


BGR 


2:3:2:1 Front 


2@0 


3@2 


2@5 


1@7 


10 


BGR 


2:3:2:1 Back 


2@8 


3@10 


2(3)13 


1@15 


11 


BGR 


2:3:2 FrontOff 


2@0 


3@2 


2@5 





12 


BGR 


2:3:2 BackOff 


2@8 


3@10 


2(3)13 





13 


BGR 


5:5:5:1 Back 


5@16 


5@21 


5(3)26 


1@31 


14 


BGR 


CI8 


8@0 











16 


BGR 


5:6:5 Front 


5@0 


6@5 


5(3)11 





17 


BGR 


5:6:5 Back 


5@16 


6@21 


5(3)27 








RGB 


8:8:8:8 


8@16 


8@8 


8(3)0 


8@24 


1 


RGB 


5:5:5:1 Front 


5@10 


5@5 


5(3)0 


1@15 


2 


RGB 


4:4:4:4 


4@8 


4@4 


4(3)0 


4@12 


5 


RGB 


3:3:2 Front 


3@5 


3@2 


2(3)0 





6 


RGB 


3:3:2 Back 


3@13 


3@10 


2(3)8 





9 


RGB 


2:3:2:1 Front 


2@5 


3@2 


2(3)0 


1@7 


10 


RGB 


2:3:2:1 Back 


2@13 


3@10 


2(3)8 


1@15 


11 


RGB 


2:3:2 FrontOff 


2@5 


3@2 


2(3)0 





12 


RGB 


2:3:2 BackOff 


2@13 


3@10 


2(3)8 





13 


RGB 


5:5:5:1 Back 


5@26 


5(3)21 


5(3)16 


1@31 


14 


RGB 


CI8 


8@0 











16 


RGB 


5:6:5 Front 


5(5)11 


6@5 


5(3)0 





17 


RGB 


5:6:5 Back 


5@27 


6@21 


5(3)16 






1) The format column is also dependant on bit16. n@m means n bits starting at bit m. Front and Back modes 
replicate the color value to assist with double buffering. CI values are replicated into each byte to assist with 
double buffering. Offset modes have 64 added to the 7 bit formatted value. If the format has no alpha bits, the 
alpha field defaults to 0xF8 



Bit1 2 NoAlphaBuffer 

= Alpha buffer present 

1 = No alpha buffer present 



7-4 



AlphaBlendMode mnemonic 



Bit13 ColorOrder: 

= BGR 

1 = RGB 

Bit14 BlendType: 

= RGB 

1 = Ramp 

Bit1 6 Color Format Extension. Most significant bit 

extension to Color Format held in bits 8-11 . 



Graphics Register Reference 7-5 



mnemonic AreaStippleMode 



AreaStippleMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Area Stipple Mode 
Scissor/Stipple 
0x0034 
Undefined 



31 



24 



16 



i I i i i r 

Reserved 



_i — i — 1_ 



YOffset 



XOffset 



Reserved 



ForceBackgroundColor 
MirrorY 
MirrorX 
Invert Stipple Pattern 



Not used 



Not used 



Enable Unit 



Controls Area Stippling. Both the AreaStippleEnable bit in the Render command and the 
enable in the AreaStippleMode register must be set to enable the area stipple test. 



BitO 

Bit7-9 
Bit1 2-1 4 
Bit17 

Bit1 8 



Unit Enable 



XOffset 



YOffset 



= Disable 

1 = Enable 



Invert Stipple Pattern 

= No Invert 

1 = Invert 



Mirror X 



= No Mirror in X 

1 = Mirror stipple pattern 
in X direction 



7-6 



AreaStippieMode mnemonic 



Bit19 Mirror Y 

= No Mirror in Y 

1 = Mirror stipple pattern 
in Y direction 

Bit20 ForceBackgroundColor. Controls operation of the 

stipple test. If disabled any fragment failing the test 
is discarded. If enabled any fragment failing the 
test is drawn (other tests allowing) but the color is 
taken from the TexelO register. Used to support 
foreground and background colors. 

= Disable 

1 = Enable 



Graphics Register Reference 7-7 



mnemonic AreaStipplePatternfO... 7) 



AreaStipplePattern(0...7) 



Name: 



Area Stipple Pattern 



Unit: 



Scissor/Stipple 



Tag: 



0x0040, ...,0x0047 



Reset Value 



Undefined 



Read/write 



31 



24 



16 



8 







Reserved 



8 bit mask 



These eight registers provide the bitmask which enables and disables 
corresponding fragments for drawing when rasterizing a primitive with area 
stippling. 

Both the AreaStippleEnable in the Render command and enable in the 
AreaStippleMode register must be set, to enable the area stipple test. 



7-8 



AStart mnemonic 



AStart 

Name: Initial Alpha Color 

Unit: Color DDA 

Tag: 0x00 F9 

Reset Value: Undefined 
Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




Integer 


i i i 

1 ■ L 


1 1 1 1 1 

Fraction 

! 1 1 1 1 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the initial value for the Alpha for a vertex when in 
Gouraud shading mode. The value is 2s-complement 6.11 fixed-point format. 



Graphics Register Reference 7-9 



mnemonic BitMaskPattern 



BitMaskPattern 

Name: Bit Mask Pattern 

Unit: Rasterizer 
Tag: OxOOO D 

Reset Value: Undefined 
Write only 

31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 

32 bit mask 



Value used to control the bit mask stipple operation (if enabled). Fragments are 
accepted or rejected based on the current BitMask test modes defined by the 
RasterizerMode register. Note that the SyncOnBitmask bit in the Render 
command must also be enabled. 



7-10 



BStart mnemonic 



BStart 

Name: Initial Blue Color 

Unit: Color DDA 

Tag: 0x00 F6 

Reset Value: Undefined 
Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




Integer 


i i i 

1 ■ L 


1 1 1 1 1 

Fraction 

! 1 1 1 1 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the initial value for the Blue for a vertex when in 
Gouraud shading mode. The value is 2s-complement 6.11 fixed-point format. 



Graphics Register Reference 7-11 



mnemonic ChromaLowerBound, Chroma UpperBound 



ChromaLowerBound,ChromaUpperBound 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Chroma Lower Bound, Chroma Upper Bound 
YUV 

0x01 E2, 0x01 E1 
Undefined 



31 




24 




16 




8 







1 1 1 1 1 1 1 

Alpha 


Blue 


Green 


i i i i i i i 
Red 


31 




24 




16 




8 







■ i i i i i i 
Alpha 


V 


U 


Y 



Specifies the lower and upper bounds for the chroma test. The test is done 
against the contents of the TexelO register which holds data in the internal RGB 
format of 5 bits integer and 3 bits fraction, or the YUV format (before 
conversion) of 8 bits per component. The test is done on all 8 bits of each 
component. All components must be inside the bounds for the test to pass, if 
TestMode is set to 1 in the YUVMode register, or fail if TestMode is set to 2 in 
the YUVMode register. 



7-12 



Color mnemonic 



Color 

Name: Color 

Unit: Color DDA 

Tag: 0x00 FE 

Reset Value: Undefined 
Write 



31 




24 




16 




8 







■ i i i i i i 
Alpha 


i i i i i i i 
Blue 


i i i i i i i 
Green 


i i i i i i i 
Red 



Used for downloading image data to the framebuffer. The format is either the 
standard color format, or the raw framebuffer format if the Color Format unit 
is disabled. 

The internal color format will interpret the 8 bit fields as either 5.3 fixed-point 
for 3D operations or 8 bit integer for 2D operations. In CI mode the color index 
is placed in bits 0-7. If there are less than 8 bits in a component it should be 
left justified and the unused bits set to zero. 

This register cannot be saved and restored as part of a task context switch. 

When used this register should always be reloaded at start of every command, 
and the Color DDA unit must be disabled prior to loading it. 



Graphics Register Reference 7-13 



mnemonic ColorDDAMode 



ColorDDAMode 



Name: Color DDA Mode 

Unit: Color DDA 

Tag: 0x00 FC 
Reset Value: Undefined 
Read/write 

31 24 16 



i — i — i — i 1 — i 1 — i — i — i ■ — i r 

Reserved 



_i — i — i — 1_ 



Shading Mode Unit Enable 

The bit fields control the mode of operation of the Color DDA unit: 
BitO Unit Enable: 

= Disable 

1 = Enable 

Bit1 Shading mode control: 

= Flat 

1 = Gouraud 



7-14 



ConstantCoior mnemonic 



ConstantColor 

Name: ConstantColor 

Unit: Color DDA 

Tag: 0x00 FD 

Reset Value: Undefined 
Read/write 



31 




24 


16 






8 









1 1 1 1 1 1 1 

Alpha 


i i i i i i i 
Blue 


i i i i i i i 
Green 


i i i i i i i 
Red 


31 




24 


16 






8 









i 


■ i i 


i ■ i i i i i i i ■ i i 

32 bit value 

1 I 1 1 1 1 1 1 1 ■ 1 !— 


1 1 1 u 


i 

i i 


i — i — 
i i 




i 



Holds the constant color in either RGBA or raw framebuffer format. This value 
is used when the ColorDDAMode register is set to flat shading mode. 

The internal color format will interpret the 8 bit fields as either 5.3 fixed-point 
for 3D operations or 8 bit integer for 2D operations. In CI mode the color index 
is placed in bits 0-7. If a component has less than 8 bits, it should be left 
justified and the unused bits set to zero. 



Graphics Register Reference 7-15 



mnemonic Continue 



Continue 



Name: 



Continue 



Unit: 



Rasterizer 



Tag: 



OxOOOB 



Reset Value: 



Undefined 



Write 



31 



24 



16 



8 







Reserved 



11 bit unsigned integer 



This command causes rasterization to continue after new delta value(s) have 
been loaded, but does not cause either of the trapezoid's edge DDAs to be 
reloaded. 

The data field holds the number of scanlines to fill. Note this count does not 
get loaded into the Count register. 



7-16 



ContinueNewDorm mnemonic 



ContinueNewDom 



Name: 



Continue - New Dominant Edge 



Unit: 



Rasterizer 



Tag: 



0x0009 



Reset Value 



Undefined 



Write 



31 



24 



16 



8 







Reserved 



11 bit unsigned integer 



This command causes rasterization to continue with a new dominant edge. 
The dominant edge DDA is reloaded with the new parameters. The 
subordinate edge is carried on from the previous trapezoid. This allows any 
convex polygon to be broken down into a collection of trapezoids and 
continuity maintained across boundaries. 

Since this command only affects the Rasterizer DDA (and not of any other 
units), it is not suitable for 3D operations. 

The data field holds the number of scanlines to fill. Note that this count does 
not get loaded into the Count register. 



Graphics Register Reference 



7-17 



mnemonic ContinueNewLine 



ContinueNewLine 



Name: 



Continue - New Line Segment 



Unit: 



Rasterizer 



Tag: 



0x0008 



Reset Value 



Undefined 



Write 



31 



24 



16 



8 







Reserved 



11 bit unsigned integer 



This command causes rasterization to continue for the next segment in a 
polyline. The XY position is carried on from the previous line, however the 
fraction bits in the DDAs can be kept, set to zero, one half, or nearly one half, 
under control of the RasterizerMode register. 

The data field holds the number of pixels in a line. Note this count does not get 
loaded into the Count register. 

The use of ContinueNewLine is not recommended for OpenGL because the 
DDA units will start with a slight error as compared with the value they would 
have been loaded with for the second and subsequent segments. 



7-18 



ContinueNewSub mnemonic 



ContinueNewSub 



Name: 



Continue - New SubordinateEdge 



Unit: 



Rasterizer 



Tag: 



OxOOOA 



Reset Value 



Undefined 



Write 



31 



24 



16 



8 







Reserved 



11 bit unsigned integer 



This command causes rasterization to continue with a new subordinate edge. 
The subordinate DDA is reloaded with the new parameters. The dominant 
edge is carried on from the previous trapezoid. This is very useful when scan 
converting triangles with a knee (i.e. two subordinate edges). 

The data field holds the number of scanlines to fill. Note this count does not 
get loaded into the Count register. 



Graphics Register Reference 



7-19 



mnemonic Count 



Count 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Count 
Rasterizer 
0x0006 
Undefined 



31 



24 



16 



T 



~ i — ■ — i 1 1 — i 1 1 — r 

11 bit unsigned integer 



_i i i i_ 



Reserved 



Interpretation of contents is dependent on the mode set in the Render 
command i.e. it specifies the number of pixels in a line, or the number of 
scanlines in a trapezoid. 



7-20 



dBdx mnemonic 



dBdx 

Name: X Derivative - Blue 

Unit: Color DDA 

Tag: 0x00 F7 

Reset Value: Undefined 
Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




Integer 


i i i 

1 ■ L 


1 1 1 1 1 

Fraction 

! 1 1 1 1 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the X derivative for the Blue value for the interior 
of a trapezoid when Gouraud shading. The value is in 2s-complement 6.11 
fixed-point format. 



Graphics Register Reference 7-21 



mnemonic dBdyDom 



dBdyDom 

Name: Y Derivative Dominant - Blue 

Unit: Color DDA 

Tag: 0x00 F8 

Reset Value: Undefined 

Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




i i i i 
Integer 


i ■ i 

I 1 I L 


1 1 1 1 1 

Fraction 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the Y derivative dominant, for the Blue value along 
aline, or for the dominant edge of a trapezoid, when in Gouraud shading mode. 
The value is in 2s-complement 6.11 fixed-point format. 



7-22 



Depth mnemonic 



Depth 

Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth 

Stencil/Depth 

0x0135 

Undefined 



31 



24 



16 



"i i i 1 1 

Not used 



— i — ■ — i 1 r 

Depth value 



31 24 16 8 



I I I I I II I 

Not used 


III! 


II ■ III I 
Depth value 









Holds an externally sourced 16 or 15 bit depth value. The unused most 
significant bits should be set to zero. 

This is used in the draw pixels function where the host supplies the depth 
values through the Depth register. 

Alternatively this is used when a constant depth value is needed, for example, 
when clearing the depth buffer, or for 2D rendering where the depth is held 
constant. 



Graphics Register Reference 7-23 



mnemonic DepthMode 



Depth Mode 

Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth Mode 
Stencil/Depth 
0x0134 
Undefined 



31 24 16 



- — i — i i — i — i — i 1 — i 1 — i — r 

Reserved 



_i i i i i i — ■ — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i— 



Compare Mode New Depth Source 



Write Mask 
Unit enable 



Controls the comparison of a fragment depth value and updating of the depth 
buffer. If the compare function is LESS and the result is true then the fragment 
value is less than the source value. 

BitO Unit Enable: 

= Disable 

1 = Enable 

Bit1 Writemask: 

= Disable write to depth buffer 

1 = Enable write to depth buffer 

Bit2-3 Source of depth value for comparison: 

= Fragment's depth value 

1 = LBData - 

for copy pixels when destination 
depth planes are not updated. 

2 = Depth register 

3 = LBSourceData- 

for copy pixels when destination 
depth planes are updated. 



7-24 



Running Title — Attribute Reference mnemonic 



Bit4-6 Comparison function: 



= 


NEVER 


1 = 


LESS 


2 = 


EQUAL 


3 = 


LESS OR EQUAL 


4 = 


GREATER 


5 = 


NOT EQUAL 


6 = 


GREATER OR EQUAL 


7 = 


ALWAYS 



Chapter Title— Attribute Reference 



7-25 



mnemonic dFdx 



dFdx 

Name: X Derivative - Fog 

Unit: Texture/Fog/Blend 

Tag: 0x00 D5 

Reset Value: Undefined 
Read/write 



31 24 16 8 



1 1 III 111 






i i i i i ■ i i i i i 




i i i i i 


Not used 






Fraction 




Not used 



Sign Integer 

Fog coefficient derivative per unit X for use in rendering trapezoids. The value 
is in 2s-complement 2.1 6 fixed-point format. 



7-26 



dFdyDom mnemonic 



dFdyDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Y Derivative Dominant - Fog 

Texture/Fog/Blend 

0x00D6 

Undefined 



31 



24 



16 



T 1 1 1 - 

Fraction 



-i — i — i 1 r 

Not used 



■ i i 



Not used 



I I 
Sign Integer 



Fog coefficient derivative per unit Y along a line, or for the dominant edge of 
a trapezoid. The value is in 2s-complement 2.1 6 fixed-point format. 



Graphics Register Reference 7-27 



mnemonic dGdx 



dGdx 

Name: X Derivative - Green 

Unit: Color DDA 

Tag: 0x00 F4 

Reset Value: Undefined 
Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




i i i i 
Integer 


i ■ i 

I 1 I L 


1 1 1 1 1 

Fraction 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the X derivative for the Green value for the interior 
of a trapezoid when Gouraud shading. The value is in 2s-complement 6.11 
fixed-point format. 



7-28 



dGdyDom mnemonic 



dGdyDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Y Derivative Dominant - Green 

Color DDA 

0x00F5 

Undefined 



31 




24 




16 




8 







■ i i i i i i 
Not used 




Integer 


i i i 

1 ■ L 


1 1 1 1 1 

Fraction 

! 1 1 1 1 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the Y derivative dominant, for the Green value along 
aline, or for the dominant edge of a trapezoid, when in Gouraud shading mode. 
The value is in 2s-complement 6.11 fixed-point format. 



Graphics Register Reference 7-29 



mnemonic DitherMode 



DitherMode 



31 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Dither Mode 
Color Format 
0x0103 
Undefined 



24 



16 



1 1 i 1 i i i ■ 1 1 1 1 1 1 

Reserved 




i 


i i 






i 


i 


i i i i 







X offset / Dither enable 
Y offset / Unit enable 



Color format extension / ForceAlpha 

Reserved DitherMethod 

Color order Color format 

Controls the Color Format unit. 
BitO Unit Enable: 

= Disable 

1 = Enable 



Bit1 



Dither Enable: 



= Disable 

1 = Enable 



See the register description above for the remaining bits. 



7-30 



DitherMode mnemonic 



Bit2-5 Color Format: 



Internal Color Channel 



Format 2 


Color Order 


Name 


R 


G 


B 


A 





BGR 


8:8:8:8 


8@0 


8@8 


8@16 


8@24 


1 


BGR 


5:5:5:1 Front 


5@0 


5@5 


5@10 


1@15 


2 


BGR 


4:4:4:4 


4@0 


4@4 


4@8 


4@12 


5 


BGR 


3:3:2 Front 


3@0 


3@3 


2@6 





6 


BGR 


3:3:2 Back 


3@8 


3@11 


2@14 





9 


BGR 


2:3:2:1 Front 


2@0 


3@2 


2@5 


1@7 


10 


BGR 


2:3:2:1 Back 


2@8 


3@10 


2@13 


1@15 


11 


BGR 


2:3:2 FrontOff 


2@0 


3@2 


2@5 





12 


BGR 


2:3:2 BackOff 


2@8 


3@10 


2@13 





13 


BGR 


5:5:5:1 Back 


5@16 


5@21 


5@26 


1@31 


14 


BGR 


CI8 


8@0 











16 


BGR 


5:6:5 Front 


5@0 


6@5 


5@11 





17 


BGR 


5:6:5 Back 


5@16 


6@21 


5@27 








RGB 


8:8:8:8 


8@16 


8@8 


8@0 


8@24 


1 


RGB 


5:5:5:1 Front 


5@10 


5@5 


5@0 


1@15 


2 


RGB 


4:4:4:4 


4@8 


4@4 


4@0 


4@12 


5 


RGB 


3:3:2 Front 


3@5 


3@2 


2@0 





6 


RGB 


3:3:2 Back 


3@13 


3@10 


2@8 





9 


RGB 


2:3:2:1 Front 


2@5 


3@2 


2@0 


1@7 


10 


RGB 


2:3:2:1 Back 


2@13 


3@10 


2@8 


1@15 


11 


RGB 


2:3:2 FrontOff 


2@5 


3@2 


2@0 





12 


RGB 


2:3:2 BackOff 


2@13 


3@10 


2@8 





13 


RGB 


5:5:5:1 Back 


5@26 


5@21 


5@16 


1@31 


14 


RGB 


CI8 


8@0 











16 


RGB 


5:6:5 Front 


5@11 


6@5 


5@0 





17 


RGB 


5:6:5 Back 


5@27 


6@21 


5@16 






2) The format column is also dependant on bit16. n@m means n bits starting at bit m. Front and Back modes 
replicate the color value to assist with double buffering. CI values are replicated into each byte to assist with 
double buffering. Offset modes have 64 added to the 7 bit formatted value. If the format has no alpha bits, the 
alpha field defaults to 0xF8 



Bit6-7 XOffset to enable window relative dithering. 

Bit8-9 YOffset to enable window relative dithering. 



Graphics Register Reference 7-31 



mnemonic DitherMode 



Bit10 



Bit11 



Bit1 2—1 3 



Color Order: 



Dither Method: 



ForceAlpha: 




1 




1 




1 
2 



BGR 
RGB 

Ordered 
Line 

Disable 
Force to 
Force to 0xF8 



Bit1 6 



Color Format Extension. Most significant bit 
extension to Color Format held in bitsO-3 



7-32 



dKddx mnemonic 



dKddx 

Name: X Derivative - Kd 

Unit: Texture/Fog/Blend 

Tag: 0x00 DD 

Reset Value: Undefined 
Read/write 



31 24 16 8 



1 1 1 1 1 1 1 






i i i i i ■ i i i i i 


i i ■ i 


i i i i i 


Not used 






Fraction 




Not used 



Sign Integer 



Diffuse light coefficient derivative per unit X for use in rendering texture 
mapped trapezoids using ramp application mode. The value is in 
2s-complement 2.1 6 fixed-point format. 



Graphics Register Reference 7-33 



mnemonic dKddyDom 



dKddyDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Y Derivative Dominant - Kd 

Texture/Fog/Blend 

OxOODE 

Undefined 



31 



24 



16 



- 1 1 — i — r 

Fraction 



— i 1 1 — r 

Not used 



■ i i 



Not used 



I I 

Sign Integer 



Diffuse light coefficient derivative per unit Y along a line, or for the dominant 
edge of a trapezoid, for use with ramp texture application mode. The value is 
in 2s-complement 2.16 fixed-point format. 



7-34 



dKsdx mnemonic 



dKsdx 

Name: X Derivative - Ks 

Unit: Texture/Fog/Blend 

Tag: 0x00 DA 

Reset Value: Undefined 
Read/write 



31 24 16 8 



1 1 1 1 1 1 1 






i i i i i ■ i i i i i 


i i ■ i 


i i i i i 


Not used 






Fraction 




Not used 



Sign Integer 



Specular light coefficient derivative per unit X for use in rendering texture 
mapped trapezoids using ramp application mode. The value is in 2s-comple- 
ment 2.16 fixed-point format. 



Graphics Register Reference 7-35 



mnemonic dKsdyDom 



dKsdyDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Y Derivative Dominant - Ks 

Texture/Fog/Blend 

OxOODB 

Undefined 



31 



24 



16 



- 1 1 — i — r 

Fraction 



— i 1 1 — r 

Not used 



■ i i 



Not used 



I I 

Sign Integer 



Specular light coefficient derivative per unit Y along a line, or for the dominant 
edge of a trapezoid, for use with ramp texture application mode. The value is 
in 2s-complement 2.16 fixed-point format. 



7-36 



dQdx mnemonic 



dQdx 

Name: X Derivative - Homogeneous texture coordinate 

Unit: Texture Address 

Tag: 0x0078 

Reset Value: Undefined 

Read/write 



31 


24 


16 


8 









1 1 1 1 1 I 1 1 1 


i i i i ■ i i i i 
Fraction 


i i i ■ i i i i 


i i 
Reserved 















I — Integer 

Sign 



Used to set the X derivative for the Q coordinate when texture mapping. 
Format is 2s-complement 2.27 fixed-point. 



Graphics Register Reference 7-37 



mnemonic dQdyDom 



dQdyDom 

Name: Y Derivative Dominant - Homogeneous texture coordinate 

Unit: Texture Address 

Tag: 0x0079 

Reset Value: Undefined 

Read/write 



31 


24 


16 


8 









1 1 1 1 1 I 1 1 1 


i i i i ■ i i i i 
Fraction 




i i 
Reserved 















I — Integer 

Sign 



Used to set the Y dominant derivative for the Q coordinate when texture 
mapping. Format is 2s-complement 2.27 fixed-point. 



7-38 



dRdx mnemonic 



dRdx 

Name: X Derivative - Red 

Unit: Color DDA 

Tag: 0x00 F1 

Reset Value: Undefined 
Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




i i i i 
Integer 


i — i ■ — r 


Fraction 


— i — ■ — t 


Not used 



L Sign 



This register is used to set the X derivative for the Red value for the interior of 
a trapezoid when Gouraud shading. The value is in 2s-complement 6.11 fixed- 
point format. 



Graphics Register Reference 7-39 



mnemonic dRdyDom 



dRdyDom 

Name: Y Derivative Dominant - Red 

Unit: Color DDA 

Tag: 0x00 F2 

Reset Value: Undefined 

Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




i i i i 
Integer 


i ■ i 

I 1 I L 


1 1 1 1 1 

Fraction 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the Y derivative dominant, for the Red value along 
aline, or for the dominant edge of a trapezoid, when in Gouraud shading mode. 
The value is in 2s-complement 6.11 fixed-point format. 



7-40 



dSdx mnemonic 



dSdx 



Name: X Derivative - Texture S coordinate 

Unit: Texture Address 

Tag: 0x0072 

Reset Value: Undefined 

Read/write 



31 


24 


16 




8 







i i i i i i ■ i i i 


i i i ■ i 


i i i i i 


i ■ i i i i i 


i 




Integer 




Fraction 



















Sign 



Reserved 



Used to set the X derivative for the S coordinate when texture mapping. Format 
is 2s-complement 12.18 fixed-point. 



Graphics Register Reference 7-41 



mnemonic dSdyDom 



dSdyDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Y Derivative Dominant - Texture S coordinate 

Texture Address 

0x0073 

Undefined 



31 


24 


16 




8 







■ i i i i i ■ i i i 




I I I I 


i ■ i i i i i 






Integer 




Fraction 



















Sign 



Reserved 

Used to set the Y dominant derivative for the S coordinate when texture 
mapping. Format is 2s-complement 12.18 fixed-point. 



7-42 



dTdx mnemonic 



dTdx 



Name: X Derivative - Texture T coordinate 

Unit: Texture Address 

Tag: 0x0075 

Reset Value: Undefined 

Read/write 



31 


24 


16 




8 







i i i i i i ■ i i i 


i i i ■ i 


i i i i i 


i ■ i i i i i 


i 




Integer 




Fraction 



















Sign 



Reserved 



Used to set the X derivative for the T coordinate when texture mapping. Format 
is 2s-complement 12.18 fixed-point. 



Graphics Register Reference 7-43 



mnemonic dTdyDom 



dTdyDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Y Derivative Dominant - Texture T coordinate 

Texture Address 

0x0076 

Undefined 



31 


24 


16 




8 







■ i i i i i ■ i i i 




I I I I 


i ■ i i i i i 






Integer 




Fraction 



















Sign 



Reserved 

Used to set the Y dominant derivative for the T coordinate when texture 
mapping. Format is 2s-complement 12.18 fixed-point. 



7-44 



dxDom mnemonic 



dXDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Delta X Dominant 
Rasterizer 
0x0001 
Undefined 



31 




24 


16 


8 





i i i 
Not used 




i i ■ i i i i i 
11 bit integer 




i i i i i i i ■ i i i i i i 
15 bit fraction 







Sign 



Not used 



Value added when moving from one scanline to the next for the dominant edge 
in trapezoid filling. The value is in 2s-complement 12.15 fixed-point format. 

Also holds the change in X when plotting lines. For Y major lines this will be 
some fraction (dx/dy), otherwise it is normally ± 1 .0, depending on the required 
scanning direction. 



Graphics Register Reference 7-45 



mnemonic dXSub 



dXSub 

Name: Delta X Subordinate 

Unit: Rasterizer 

Tag: 0x0003 

Reset Value: Undefined 
Read/write 



31 




24 


16 


8 





1 1 1 

Not used 




i ■ ■ i i i i i 
11 bit integer 




i i i i i i i ■ i i i i i i 
15 bit fraction 







Sign Not used 



Value added when moving from one scanline to the next for the subordinate 
edge in trapezoid filling. The value is in 2s-complement 12.15 fixed-point 
format. 



7-46 



dY mnemonic 



dY 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Delta Y 
Rasterizer 
0x0005 
Undefined 



31 



24 



16 



"1 1 i 1 

Not used 



10 bit integer 



14 bit fraction 



_i — i — i — ■ — i — i — 1_ 



-I 1 L 



Sign 



Not used 



Value added to Y to move from one scanline to the next. 

For X major lines this will be some fraction (dy/dx), otherwise it is normally 
±1 .0, depending on the required scanning direction. The value is in 2s-comple- 
ment 11.14 fixed-point format. 



For trapezoids the value will be ±1 .0 depending on the scanning direction. 



Graphics Register Reference 7-47 



mnemonic dZdxL 



dZdxL 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth Derivative X - Lower 

Stencil/Depth 

0x0139 

Undefined 



31 24 16 8 



1 I 1 I 1 1 1 1 1 1 

11 bit fraction 




Not used 


i i i i i i i i 











This register holds part of the depth derivative per unit in X used in rendering 
trapezoids. dZdxU holds the most significant bits, and dZdxL the least 
significant bits. The combined value is in 2s-complement 17.11 fixed-point 
format. 



7-48 



dzdxu mnemonic 



dZdxU 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth Derivative X - Upper 

Stencil/Depth 

0x0138 

Undefined 



31 


24 


16 


8 





i i i 


i i i i i i i i i i i 




i i i i i i i i i i i i 


i i i 




Not Used 




16 bit integer 




i i i_ 


i i i — i — i — i — i i — i — i — i — 




— i — i — i — i — i — i — i — ■ — i — i — i — i — 





I 



Sign 

This register holds part of the depth derivative per unit in X used in rendering 
trapezoids. dZdxU holds the most significant bits, and dZdxL the least signifi- 
cant bits. The value is in 2s-complement 1 7.1 1 fixed-point format. 



Graphics Register Reference 7-49 



mnemonic dZdyDomL 



dZdyDomL 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth Derivative Y Dominant - Lower 

Stencil/Depth 

0x01 3B 

Undefined 



31 24 16 8 



1 1 1 1 1 1 1 I 1 1 

11 bit fraction 




i i i i i 
Not used 


i i i i i i i i 











This register holds part of the depth derivative per unit in Y used for the 
dominant edge of a trapezoid, or along a line. dZdyDomU holds the most 
significant bits, and dZdyDomL the least significant bits. The value is in 
2s-complement 17.11 fixed-point format. 



7-50 



dZdyDomU mnemonic 



dZdyDomll 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth Derivative Y Dominant - Upper 

Stencil/Depth 

0x01 3A 

Undefined 



31 


24 


16 


8 





i i i 


i i i i i i i i i i i 




i i i i i i i i i i i i 


i i i 




Not Used 




16 bit integer 




i i i_ 


i i i — i — i — i — i i — i — i — i — 




— i — i — i — i — i — i — i — ■ — i — i — i — i — 





I 



Sign 



This register holds part of the depth derivative per unit in Y used for the 
dominant edge of a trapezoid, or along a line. dZdyDomU holds the most 
significant bits, and dZdyDomL the least significant bits. The value is in 
2s-complement 17.11 fixed-point format. 



Graphics Register Reference 7-51 



mnemonic FBBiockCoior 



FBBIockColor 



Name: 



Framebuffer Block Color 



Unit: 



Framebuffer R/W 



Tag: 



0x0159 



Reset Value 



Undefined 



Read/write 



31 



24 



16 



8 







J I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I L 



Contains the color (and optionally alpha value) to be written to the framebuffer 
during block writes. Note the format is the raw data format of the framebuffer. 

If the framebuffer is used in 8-bit packed mode, then data should be repeated 
in all four bytes of the register. 

If the framebuffer is in 1 6-bit packed mode then the data must be repeated in 
both halves of the register. 

Note that this register should not be updated immediately after a Render 
command which performs a block write. 



32 bit value 



7-52 



FBCoior mnemonic 



FBColor 



Name: 



Framebuffer Color Upload 



Unit: 



Framebuffer R/W 



Tag: 



0x0153 



Reset Value: 



Undefined 



Read/write 



31 



24 



16 



8 







t 1 i — i — i — i i — ■ — i 1 — i 1 — i — i — i ■ — i 1 — i 1 1 — i 1 — ■ — i 1 1 — i — i 1 r 



J I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I L 



Internal register used in image upload. Note that this register should not be 
written to. It is documented here to give the format and tag value of the data 
returned through the Host Out FIFO. 

The format is dependant on the raw framebuffer organization and any 
reformatting which takes place due to the format specified in the DitherMode 
register. 



32 bit framebuffer data 



Graphics Register Reference 



7-53 



mnemonic FBData 



FBData 

Name: Framebuffer Data 

Unit: Framebuffer R/W 

Tag: 0x0154 

Reset Value: Undefined 
Write 

31 24 16 8 

32 bit value 



Supplies the data for image download, where subsequent formatting is 
required. The formatting can be achieved by means of the AlphaBlendMode 
register to convert to the internal TVP4010 format, and then via the 
DitherMode register to convert to the required format. 



7-54 



FBHardwareWhteMask mnemonic 



FBHardwareWriteMask 



Name: 



Hardware Writemask 



Unit: 



Framebuffer R/W 



Tag: 



0x0158 



Reset Value 



Undefined 



Read/write 



31 



24 



16 



8 







t 1 i — i — i — i i — ■ — i 1 — i 1 — i — i — i ■ — i 1 — i 1 1 — i 1 — ■ — i 1 1 — i — i 1 r 



j i i i i i i i i i i i i i i ■ I ! i i i i i ■ I I I I I I L 



Contains the hardware writemask for the framebuffer. If a bit is set to one then 
the corresponding bit in the framebuffer is enabled for writing, otherwise it is 
disabled. Only applicable to configurations where the framebuffer supports a 
hardware writemask. In cases where it is not supported, this register should 
not be written to. 

If hardware writemasks are used then all the bits in the FBSoftwareWriteMask 
register must be set to 1 , so that software writemasking is disabled. 

If the framebuffer is used in 8-bit packed mode, then an 8bit hardware write- 
mask must be repeated in all four bytes of the FBHardwareWriteMask register. 

If the framebuffer is in 1 6-bit packed mode then the 1 6 bit hardware writemask 
must be repeated in both halves of the FBHardwareWriteMask register. 



32 bit mask 



Graphics Register Reference 



7-55 



mnemonic FBPixeiOffset 



FBPixelOffset 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Framebuffer Pixel Offset 
Framebuffer R/W 
0x0152 
Undefined 



31 



24 



16 8 

I ■ 1 1 1 1 1 1 1 ¥ 

24 bit 2s-complement integer 



l i I i i i 

Not used 
_i i i i i i_ 



Offset between buffers when operating on multiple buffers in the framebuffer 
at the same time (e.g. left/right/top/bottom in some OpenGL implementations). 
The offset can be treated as signed or unsigned. 



7-56 



FBReadMode mnemonic 



FBReadMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Framebuffer Read Mode 
Framebuffer R/W 
0x0150 
Undefined 



31 



24 



16 



Reserved 



_i — i — 1_ 



Reserved 



PP2 



PP1 



PPO 



ata type 



Patch Mode 

Reserved Packed data 

Relative offset Patch Enable \ Window origin 




Partial product selection 



ReadSource enable 



ReadDestination enable 



Controls reading from framebuffer memory. 

Incorrect data can be read if reads are enabled but the same data had just 
been written with reads disabled. To avoid this problem, a WaitForCompletion 
command should be sent after enabling reads, but prior to the next primitive. 



BitO-2 



Bit3-5 



Bit6-8 



Bit9 



Bit10 



Partial Product - See Appendix C for a table of 
values. 

Partial Product 1 - See Appendix C for a table of 
values. 

Partial Product 2 - See Appendix C for a table of 
values. 

Read Source Enable: 

= no read 

1 = do read 

Read Destination Enable: 

= no read 

1 = do read 



Graphics Register Reference 7-57 



mnemonic FBReadMode 



Bin 5 



Data Type: 



Bit1 6 



Bit1 8 



Window Origin: 



Patch Enable: 



Bit1 9 



Packed Data: 



Bit20-22 



RelativeOffset 



Bit25-26 



Patch Mode 



= FBDefault - for data that may 

be written back to the frame 
buffer 

1 = FBColor - for image upload 

= Top left 

1 = Bottom left 



= Disable 

1 = Enable patched addressing 

for framebuffer accesses 



= Disable. Force TVP4010 to 

read one pixel at a time. 

1 = Enable. Allow TVP4010 to 

read multiple packed pixels 
when possible. 



3 bit 2's compliment value which 
specifies the number of pixels 
that the source data has to be 
adjusted to align with the 
destination data. 



= Patch (suitable for depth 

buffer patching) 

1 = Subpatch (suitable for texture 

buffer patching) 

2 = SubpatchPack (suitable for 

packed texture patching) 



7-58 



FBReadPixel mnemonic 



FBReadPixel 

Name: Framebuffer Read Pixel 

Unit: Framebuffer R/W 

Tag: 0x01 5 A 

Reset Value: Undefined 
Read/write 

31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 

Reserved 



Pixel Size 

Sets the pixel size for reading from the framebuffer. 
Bit0-1 Pixel Size: 

= 8 bits 

1 = 16 bits 

2 = 32 bits 



Graphics Register Reference 7-59 



mnemonic FBSoftwareWriteMask 



FBSoftwareWriteMask 



Name: 



Software Writemask 



Unit: 



Logic Op 



Tag: 



0x0104 



Reset Value 



Undefined 



Read/write 



31 



24 



16 



8 







t 1 i — i — i i — i — ■ — i 1 — i 1 — i 1 — i ■ — i 1 — i 1 — i — i 1 — ■ — i 1 1 — i 1 1 — r 



Contains the software writemask for the framebuffer. If a bit is set to one then 
the corresponding bit in the framebuffer is enabled for writing, otherwise it is 
disabled. In addition, whenever the writemask is other than all 1 s, framebuffer 
reads must be enabled by setting the ReadSourceEnable bit in the FBRead- 
Mode register. 

If hardware writemasks are used then all the bits in the software writemask 
must be set to 1 , so that software writemasking is disabled. 



32 bit mask 



7-60 



FBSourceData mnemonic 



FBSourceData 

Name: Framebuffer Source Data 

Unit: Framebuffer R/W 

Tag: 0x0155 
Reset Value: Undefined 
Write 

31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 

32-bit value 



Supplies the data for image download with logic ops, where the data is treated 
as the source rather than the destination parameter. 



Graphics Register Reference 7-61 



mnemonic FBSourceOffset 



FBSourceOffset 



Name: Framebuffer Source Offset 

Unit: Framebuffer R/W 

Tag: 0x0151 

Reset Value: Undefined 

Read/write 



31 24 16 

t 1 — i 1 — i 1 — i ■ — i 1 — i r 



l i I i i i 

Not used 
_i i i i i i_ 



24 bit 2's complement integer 



Sets the offset from destination to source for a copy operation in the frame 
buffer i.e.; 

source offset = destination address - source address 



7-62 



FBWindowBase mnemonic 



FBWindowBase 



Name: Framebuffer Window Base 

Unit: Framebuffer R/W 

Tag: 0x0156 

Reset Value: Undefined 

Read/write 



31 24 16 

I I i I i i r 



1 — I 1 — I 1 1 — I 1- 

24 bit unsigned integer 



Reserved 
_i i i i 



Contains the current base address of the window in the framebuffer. 



Graphics Register Reference 7-63 



mnemonic FBWriteData 



FBWriteData 



Name: 



Framebuffer Write Data 



Unit: 



Logic Op 



Tag: 



0x106 



Reset Value 



Undefined 



Read/write 



31 



24 



16 



8 







J I I 1 1 1 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L 



Contains the color value to be written to the framebuffer when the 
UseConstantFBWriteData bit of the LogicalOpMode register is set to one. 
Note that the following conditions must be met for this mode of rendering to 
be used: 

□ Flat shaded aliased primitive 

□ No dithering required 

□ No logical operation involving a destination factor 

□ No stencil or depth test 

□ No texture, fog or alpha blending 

□ No software writemasking 

The data is in the raw format of the framebuffer. If the pixel size is 8 bits then 
the data should be repeated in all four bytes. If the pixel size is 1 6 bits the data 
should be repeated in both halves of the word. 

Hardware writemasks can be used if available. 



32 bit data 



7-64 



FBWriteMode mnemonic 



FBWriteMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Framebuffer Write Mode 
Framebuffer R/W 
0x0157 
Undefined 



31 



24 



16 



~1 1 i 1 i i i 1 1 T 



-J 1 1 1 1 1 1 1 1 L- 



1 1 1 1 ■ 1 T 

Reserved 



_i — i — i — ■ — i — i — 1_ 



UpLoadData 

Reserved 
Write enable 



Controls writing to the framebuffer. 

BitO Write Enable: 



Bit3 



UpLoadData: 



= Disable 

1 = Enable 



= No upload 

2 = Upload color to host 



Graphics Register Reference 7-65 



mnemonic FilterMode 



FilterMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Filter Mode 
Host Out 
0x0180 
Undefined 



31 



24 



16 



i i i I I 

Reserved 



Individual bits defined below 



Controls culling of information from the output FIFO. If both tag and data are 
specified, then the tag is always the first word in the FIFO. 



BitO-3 
Bit4 



Bit5 



Bit6 



Diagnostic use only - set to zero. 

Depth Tag Filter: Used in-depth buffer image 
upload. 

= Cull Depth Tags from being 

passed to output FIFO 

1 = Pass Depth Tags to output 

FIFO 

Depth Data Filter: Used in-depth buffer image 
upload 

= Cull Depth data values from 

being passed to output FIFO 

1 = Pass Depth data values to 

output FIFO 

Stencil Tag Filter: Used in Stencil buffer image 
upload 

= Cull Stencil Tags from being 

passed to output FIFO 

1 = Pass Stencil Tags to output 

FIFO 



7-66 



FiiterMode mnemonic 



Bit7 Stencil Data Filter: Used in Stencil buffer image 

upload 

= Cull Stencil data values 

from being passed out 
put FIFO 

1 = Pass Stencil data values 

to output FIFO 

Bit8 Color Tag Filter: Used in Framebuffer image 

upload 

= Cull Color Tags from being 

passed to output FIFO 

1 = Pass Color Tags to output 

FIFO 

Bit9 Color Data Filter: Used in Framebuffer image 

upload 

= Cull Color data values from 

being passed to output 
FIFO 

1 = Pass Color data values to 

output FIFO 

Bit10 Synchronization Tag Filter: 

= Cull Synchronization Tags 

from being passed to output 
FIFO 

1 = Pass Synchronization Tags to 

output FIFO 

Bit11 Synchronization Data Filter: 

= Cull Synchronization data 

values from being passed to 
output FIFO 

1 = Pass Synchronization data 

values to output FIFO 

Bit12 Statistics Tag Filter: Used in Picking and Extent 

read back 

= Cull Statistics Tags from 

being passed to 
output FIFO 

1 = Pass Statistics Tags to 

output FIFO 



Graphics Register Reference 7-67 



mnemonic FilterMode 



Bit13 Statistics Data Filter: Used in Picking and Extent 

read back 

= Cull Statistics data values 

from being passed to output 
FIFO 

1 = Pass Statistics data values to 

output FIFO 

Bit1 4-1 5 Diagnostic use only - set to zero. 



7-68 



FogCoior mnemonic 



FogColor 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Fog Color 
Texture/Fog/Blend 
0x00D3 
Undefined 



31 24 16 8 



I I 1 I 
Alpha 


■ i 
Not used 


Blue 


Not used 


i i i i 
Green 


Not used 


i i i i 
Red 


i i 
Not used 



Provides the color to be blended with the fragment color when fogging is 
enabled. 



Graphics Register Reference 7-69 



mnemonic FogMode 



FogMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Fog Mode 
Texture/Fog/Blend 
0x00D2 
Undefined 



31 



24 



16 



~i — i — i — i ■ — i r 

Reserved 



i I i i i 



_l I I 1_ 



_l 1 1 1_ 



—I 1 1 1 1 1 1— 



I 

FogTest 
Reserved 
Fog Enable 

Controls operation of the Fog unit. 

Enabling FogTest causes fragments with negative fog values to be rejected. 

Note that the FogEnable bit in the Render command must be set for fogging 
to be applied to a primitive. 



BitO 



Bit2 



Enable Fog: 



Fog Test: 



= Disable 

1 = Enable 



= Disable 

1 = Enable 



7-70 



FStart mnemonic 



FStart 

Name: Initial Fog Value 

Unit: Texture/Fog/Blend 

Tag: 0x00 D4 

Reset Value: Undefined 
Read/write 



31 24 16 8 



1 1 1 1 1 1 1 






i i i i i ■ i i i i i 


i i ■ i 


i i i i i 


Not used 






Fraction 




Not used 



Sign Integer 



Fog coefficient start value. Note the interpolation coefficient is used to blend the fragment 
color with the color in the FogColor register. The value is in 2s-complement 2.1 6 fixed-point 
format. 



Graphics Register Reference 7-71 



mnemonic GStart 



GStart 

Name: Initial Green Color 

Unit: Color DDA 

Tag: 0x00 F3 

Reset Value: Undefined 
Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




i i i i 
Integer 


i ■ i 

I 1 I L 


1 1 1 1 1 

Fraction 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the initial value for the Green value for a vertex when in Gouraud 
shading mode. The value is 2s-complement 6.11 fixed-point format. 



7-72 



KdStart mnemonic 



KdStart 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Initial Kd Value 
Texture/Fog/Blend 
OxOODC 
Undefined 



31 



24 



16 



T 1 1 1 - 

Fraction 



-i — i — i 1 r 

Not used 



■ i i 



Not used 



I I 
Sign Integer 



Start value for diffuse light parameter when texture mapping using ramp application mode. 
The value is in 2s-complement 2.16 fixed-point format. 



Graphics Register Reference 7-73 



mnemonic KsStart 



KsStart 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Initial Ks Value 
Texture/Fog/Blend 
0x00D9 
Undefined 



31 



24 



16 



- 1 1 — i — r 

Fraction 



— i — i — i — r 
Not used 



Not used 



I I 

Sign Integer 



Start value for specular light parameter when texture mapping using ramp application 
mode. The value is in 2s-complement 2.16 fixed-point format. 



7-74 



LBData mnemonic 



LBData 



Name: 

Unit: 

Tag: 

Reset Value: 
Write 



Localbuffer Data Download 
Localbuffer R/W 
0x0113 
Undefined 



31 


24 


16 


8 





i i i i i i i i i i i i i i 
Reserved 




15 or 16-bit Depth value 



1 bit Stencil value 



Used to download depth and/or stencil data to localbuffer memory. Data should be 
supplied in the raw localbuffer format. 



Graphics Register Reference 7-75 



mnemonic LBDepth 



LBDepth 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Localbuffer Depth Upload 
Localbuffer R/W 
0x0116 
Undefined 



31 



24 



16 



_l I I I I L. 



_j 1 1 1_ 



16-bit Depth value 



Used to upload depth data from localbuffer memory. This register should not be written to. 
It is documented here to give the tag value and format of the data when read from the Host 
Out FIFO. If the depth buffer is less than 16 bits, the depth value is right justified and zero 
extended. 



7-76 



LBReadFormat mnemonic 



LBReadFormat 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Localbuffer Read Format 
Local buffer R/W 
0x0111 
Undefined 



31 



24 



16 



■ i i 



Reserved 



_l I 1 u 



_1 1 1 1 1 1 1_ 



Stencil Width 

Depth Width 

Specifies the format used when reading from localbuffer memory. The effect 
of creating a format with overlapping fields is undefined. There is no need to 
synchronize the TVP401 before changing this register. 



Bit0-1 



Depth Width: 



Bit2-3 



Stencil Width: 



= 16 

1 = reserved 

2 = reserved 

3 = 15 



= 

1 = reserved 

2 = reserved 

3 = 1 



Graphics Register Reference 7-77 



mnemonic LBReadMode 



LBReadMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Localbuffer Read Mode 
Localbuffer R/W 
0x0110 
Undefined 



31 24 


16 




8 







■ i i i i i i ■ i i i 

Reserved 








Reserved 






■ i 

PP2 


PP1 


PPO 



Patch Enable Data Type 
Window origin 



Partial product selection 



ReadSource enable 
ReadDestination enable 



Controls reading from localbuffer memory. 

Incorrect data can be read if reads are enabled but the same data had just 
been written with reads disabled. To avoid this problem, a WaitForCompletion 
command should be sent after enabling reads, but prior to the next primitive. 



Bit0-2 



Bit3-5 



Bit6-8 



Bit9 



Bit10 



Partial Product - See Appendix C for a table of 
values 

Partial Product 1 - See Appendix C for a table of 
values 

Partial Product 2 - See Appendix C for a table of 
values 

Read Source Enable: 

= no read 

1 = do read 

Read Destination Enable: 

= no read 

1 = do read 



7-78 



LBReadMode mnemonic 



Bit1 6—1 7 Data Type: 

= Default 

1 = Localbuffer Stencil 

2 = Localbuffer Depth 

Bit18 Window Origin: 

= Top left 

1 = Bottom left 

Bit19 Patch Enable 

= Disable 

1 = Enable patched addressing of 

the localbuffer 



Chapter Title— Attribute Reference 



7-79 



mnemonic LBSourceOffset 



LBSourceOffset 



Name: Localbuffer Source Offset 

Unit: Localbuffer R/W 

Tag: 0x0112 

Reset Value: Undefined 
Read/write 



31 24 16 

t 1 — i 1 — i 1 — i ■ — i 1 — i r 



l i I i i i 

Not used 
_i i i i i i_ 



24 bit signed integer 



Sets the offset from destination to source for a copy operation in the 
localbuffer, i.e.: 

source offset = destination address - source address 



7-80 



LBStencil mnemonic 



LBStencil 



Name: 

Unit: 

Tag: 



Localbuffer Stencil Upload 
Local buffer R/W 
0x0115 



Reset Value: Undefined 
Read/Output 



31 



24 

"i i i 1~ 



_l I 1 u 



16 

i — r 





_i — i 1 1_ 



1 bit Stencil value 



Used to upload stencil data from localbuffer memory. This register should not 
be written to. It is documented here to give the tag value and format of the data 
when read from the Host Out FIFO. 



Graphics Register Reference 7-81 



mnemonic LBWindowBase 



LBWindowBase 



Name: Localbuffer Window Base 

Unit: Localbuffer R/W 

Tag: 0x0117 

Reset Value: Undefined 
Read/write 



31 24 16 

t 1 — i 1 — i 1 — i ■ — i 1 — i r 



l i I i i i 

Not used 
_i i i i i i_ 



24 bit unsigned integer 



Contains the current base address of the window in the localbuffer. 



7-82 



LBWriteFormat mnemonic 



LBWriteFormat 

Name: Localbuffer Write Format 

Unit: Localbuffer R/W 

Tag: 0x0119 
Reset Value: Undefined 
Read/write 

31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Reserved 



Stencil Width 

Depth Width 

Specifies the format used when writing to localbuffer memory. The effect of 
setting a configuration with overlapping fields is undefined. 

Bit0-1 Depth Width: 

= 16 

1 = reserved 

2 = reserved 

3 = 15 

Bit2-3 Stencil Width: 

= 

1 = reserved 

2 = reserved 

3 = 1 



Graphics Register Reference 7-83 



mnemonic LBWhteMode 



LBWriteMode 



Name: Localbuffer Write Mode 

Unit: Localbuffer R/W 

Tag: 0x0118 

Reset Value: Undefined 
Read/write 

31 24 16 



"i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 T 

Reserved 



_l I I I I I ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 u 



Write Enable 



Controls writing to the localbuffer. 

BitO Write Enable: 



= Disable 

1 = Enable 



7-84 



LogicaiOpMode mnemonic 



LogicalOpMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Logic Op Mode 
Logic Op 
0x0105 
Undefined 



31 



24 



16 



■ i i 



Reserved 



-I I 1 u 



LogicOp 
— i — i — i— 



UseConstantFBWriteData 



LogicalOp enable 



Controls Logical Operations on the framebuffer. 



The UseConstantFBWriteData bit when set to one, causes the color value in 
the FBWriteData register to be written to the framebuffer, rather than the 
fragment color. This can achieve higher bandwidth into the framebuffer for flat 
shaded primitives, but may only be used when LogicalOps are disabled (bit 
cleared to 0). 





BitO 




Logic Op Enable: 

= Disable 

1 = Enable 






Bit1-4 




Logic Op: 






Mode 


Name 


Operation 


Mode 


Name 


Operation 





CLEAR 





8 


NOR 


~(S | D) 


1 


AND 


S&D 


9 


EQUIV 


~(S A D) 


2 


AND REVERSE 


S&-D 


10 


INVERT 


~D 


3 


COPY 


S 


11 


OR REVERSE 


S | ~D 


4 


AND INVERTED 


-S&D 


12 


COPY INVERT 


~S 


5 


NO-OP 


D 


13 


OR INVERT 


~S | D 


6 


XOR 


S A D 


14 


NAND 


~(S & D) 


7 


OR 


S | D 


15 


SET 


1 



Where: 



S = Source (fragment) color, D = Destination (framebuffer) color. 



Graphics Register Reference 



7-85 



mnemonic LogicalOpMode 



Bit5 UseConstantFBWriteData: 

= Variable 

1 = Constant 



7-86 



MaxHitRegion mnemonic 



MaxHitRegion 



Name: Max Hit Region 

Unit: Host Out 

Tag: 0x0186 

Reset Value: Undefined 
Write 



31 



The format of the data input is: 

24 16 



~ I I i I i i T" 



_l I I 1_ 



Reserved 



_i — i — i — i — 1_ 



_i — i — i — i — i — i — i — i — i — 1_ 



31 



The format of the data output is: 

24 16 



~l 1 i 1 i i i 1 1 1 1 1 T 

16-bit 2s-complement integer Max Y 
_i i i i i — i — i — i — i — i — i — i — i 



i — i 1 1 — i 1 — ■ — i 1 1 — i — i 1 r 

16-bit 2s-complement integer Max X 

! 1 1 1 1 1 ■ 1 1 1 1 1 1 1— 



This command causes the maximum coordinates of the hit region to be passed 
to the Host Out FIFO, unless culled by the statistics bits in the FilterMode 
register. 



The corresponding tag value output is: 0x1 86. 



Graphics Register Reference 7-87 



mnemonic Max Reg ion 



MaxRegion 



Name: 



Max Region 



Unit: 



Host Out 



Tag: 



0x0183 



Reset Value: 



Undefined 



Read/write 



31 



24 



16 



8 







16-bit 2s-complement integer Max Y 



16-bit 2s-complement integer Max X 



This register has two uses: 

1 . During Picking it contains the maximum (X,Y) value for the pick region. 

2. During Extent collection, it is set to the initial minimum (X,Y) extent, and 
thereafter will be updated whenever an eligible fragment is generated which 
has a higher X or Y value, with that higher value. Note that eligible fragments 
can be either those that are written as pixels OR those that were rasterized, 
but were culled from being drawn, as controlled by the StatisticMode register. 

This register is unusual in that its contents are updated by the TVP4010 during 
rendering, and so if read back, will not necessarily be the same as when 
originally stored. 



7-88 



MinHitRegion mnemonic 



MinHitRegion 



Name: Min Hit Region 

Unit: Host Out 

Tag: 0x0185 

Reset Value: Undefined 
Write 



31 



The format of the data input is: 

24 16 



~ I I i I i i T" 



_l I I 1_ 



Reserved 



_i — i — i — i — 1_ 



_i — i — i — i — i — i — i — i — i — 1_ 



31 



The format of the data output is: 

24 16 



~l 1 i 1 i i i 1 1 1 1 1 1 

16-bit 2s-complement integer Min Y 
_i i i i i — i — i — i — i — i — i — i — i 



— i 1 1 — i 1 — ■ — i 1 1 — i — i 1 r 

16-bit 2s-complement integer Min X 

1 1 1 1 1 ■ 1 1 1 1 1 1 u 



This command causes the minimum coordinates of the hit region to be passed 
to the Host Out FIFO, unless culled by the statistics bits in the FilterMode 
register. 



The corresponding tag value output is: 0x1 85. 



Graphics Register Reference 7-89 



mnemonic MinRegion 



MinRegion 



Name: 



Min Region 



Unit: 



Host Out 



Tag: 



0x0182 



Reset Value: 



Undefined 



Read/write 



31 



24 



16 



8 







16-bit 2s-complement integer Min Y 



16-bit 2s-complement integer Min X 



This register has two uses: 

1 . During Picking it contains the minimum (X,Y) value for the pick region. 

2. During Extent collection, it is set to the initial maximum (X,Y) extent, and 
thereafter will be updated whenever an eligible fragment is generated which 
has a lower X or Y value, with that lower value. Note eligible fragments can be 
either those that are written as pixels OR those that were rasterized, but were 
culled from being drawn, as controlled by the StatisticMode register. 

This register is unusual in that its contents are updated by the TVP4010 during 
rendering, and so if read back, will not necessarily be the same as when 
originally stored. 



7-90 



PackedDataLimits mnemonic 



PackedDataLimits 

Name: Packed copy limits 

Units: Framebuffer R/W 

Tag: 0x002 A 

Reset Value: Undefined 
Read/write 



31 


24 


16 




8 





i i i 
Not used 


i i i ■ i i i i i i i 
12 bit integer XStart 


Not used 


12 bit integer XEnd 



Sets the start and end limits in X for packed copies. Any pixels lying outside 
the specified range are not plotted. This test is only active when the 
PackedData bit in FBReadMode is enabled. 

Bit0-1 1 XEnd: 1 2-bit 2s-complement value 



Bit1 6-27 XStart: 1 2-bit 2s-complement value 



Graphics Register Reference 7-91 



mnemonic PickResuit 



PickResult 



Name: 

Unit: 

Tag: 

Reset Value: 
Write 



31 



Pick Result 
Host Out 
0x0187 
Undefined 

The format of the data input is: 

24 16 



~ i I i i i I 



_1 i i i_ 



Reserved 



—I 1 1 ■ 1 1 1 1 u 



31 



The format of the data output is: 

24 16 



~ i I i i i I 



_1 i i i i i 1 ■ 1 1 i_ 



Reserved 



_i — i — i — i — i — i — 1_ 



PickFlag 

This command causes the current status of the picking result to be passed to 
the Host Out FIFO, unless culled by the statistics bits in the FilterMode register. 

The corresponding tag value output is: 0x187 

BitO PickFlag: 



Bit1 



BusyFlag: 



= Miss 

1 = Hit has occurred 



= Idle 

1 = Busy - used to validate the 
Pick Flag bit if this register is 
polled directly 



7-92 



QStart mnemonic 



QStart 



Name: 

Unit: 

Tag: 

Reset Value: 
Write 



31 



Initial texture Q value 
Texture Address 
0x0077 
Undefined 

The format of the data input is: 

24 16 



-i — i — i — r 
Fraction 



Reserved 



L 



Integer 



Sign 



Used to set the initial value for the Q coordinate when texture mapping. Format 
is 2s-complement 2.27 fixed-point. 



Graphics Register Reference 7-93 



mnemonic RasterizerMode 



RasterizerMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Rasterizer Mode 
Rasterizer 
0x0014 
Undefined 



31 


24 


16 




8 







i i i i i ■ i i i 
Reserved 








i — i — i 


i i i i 
BitMaskOffset 




i 









































BitMaskRelative 

LimitsEnable 

Reserved 
HostDataByteSwapMode 





BitMaskPacking / BiasCoordinates 

BitMaskByteSwapMode / FractionAdjust 

ForceBackgroundColor InvertBitMask 

MirrorBitMask 



Defines the long term mode of operation of the Rasterizer. 



BitO 



MirrorBitMask 



Bit1 



Bit2-3 



InvertBitMask 



= use bit mask from least to 
most significant bit 

1 = use bit mask from most to 
least significant bit 



= test against bitmask 

1 = test against inverted bitmask 



FractionAdjust These bits are for the Continue 
NewLine command and specify how the fraction 
bits in the Y and XDom DDAs are adjusted. 

= No adjustment is done, 

1 = Set the fraction bits to zero, 

2 = Set the fraction bits to half. 

3 = Set the fraction to nearly half, 

i.e. 0x7FFF 



7-94 



FtasterizerMode mnemonic 



Bit4- 5 BiasCoordinates These bits control how much is 

added onto the StartXDom, StartXSub and StartY 
values when they are loaded into the DDA units. 
The original registers are not effected. 

= Zero is added, 

1 = Half is added 

2 = Nearly half is added, i.e. 

0x7FFF 

Bit6 ForceBackgroundColor Controls operation of bit 

mask test. If disabled any fragment failing the test 
is discarded. If enabled any fragment failing the 
test is drawn (other tests allowing) but the color is 
taken from the TexelO register. Used to support 
foreground/background colors. 

= disabled 

1 = enabled 

Bit7— 8 BitMaskByteSwapMode. Controls byte swapping 

forbitmask. Input ABCD. 

= ABCD 

1 = BADC 

2 = CDAB 

3 = DCBA 

Bit9 BitMaskPacking. 

= bitmask packed 

1 = new data every scanline 

Bit1 0-1 4 BitMaskOffset. Position of first bit to test in bit 

mask. 

Bit1 5-1 6 HostDataByteSwapMode. Controls byte 

swapping for host data. Input ABCD. 

= ABCD 

1 = BADC 

2 = CDAB 

3 = DCBA 

Bit1 8 LimitsEnable. Enable X and Y limits checking. 

= disabled 

1 = enabled 



Chapter Title— Attribute Reference 



7-95 



mnemonic RasterizerMode 



Bit19 BitMaskRelative 

= bitmask indexed by counter 

1 = bitmask indexed by X position 



7-96 



Render mnemonic 



Render 



Name: 

Unit: 

Tag: 

Reset Value: 
Write 



Render 
Rasterizer 
0x0007 
Undefined 



31 



24 



16 



i I i i i r 

Reserved 



Reserved 



ReuseBitMask 

SubPixelCorrectionEnable 

Reserved 




Fog enable 
Texture enable 
SyncOnHostData 
SyncOnBitMask 



Primitive type 

Reserved 
Fast Fill enable 



Reserved 
AreaStippleEnable 



Command to start the rendering process. 

The data field defines the short term modes required by this primitive. 



BitO 



AreaStippleEnable. Note that area stipple in the 
Stipple Unit must be enabled as well for stippling 
to occur. 

= Disable 

1 = Enable 



Bit3 



Bit6-7 



FastFillEnable 



= Disable block filling 

1 = Enable block filling 



Bit11 



PrimitiveType These bits indicate the type of 
TVP4010 primitive to be drawn. The primitives 
supported and the corresponding codes are: 

= lines, 

1 = trapezoids, 

2 = points. 

SyncOnBitMask Enable bitmask test. Wait for 
new bitmask when current one expires unless 



Graphics Register Reference 7-97 



mnemonic Render 



SyncOnHostData or ReuseBitMask enabled. 

= Disable 

1 = Enable 

Bit1 2 SyncOnHostData. When this bit is set, a fragment 

is produced only when one of the following regis 
ters has been written by the host: Depth, FBData, 
FBSourceData, Stencil, Color or TexelO. Also Bit 
MaskPattern if SyncOnBitMask is set. 

= Disable 

1 = Enable 

Bit1 3 TextureEnable. Note that the Texture Units must 

be suitably enabled as well for any texturing to 
occur. 

= Disable 

1 = Enable 

Bit14 FogEnable. Note that the Fog Unit must be 

suitably enabled as well for any fogging to occur. 

= Disable 

1 = Enable 

Bit1 6 SubPixelCorrectionEnable. Enables the sub pixel 

correction of color, depth, fog and texture values 
at the start of a scanline span. 

= Disable 

1 = Enable 

Bit17 ReuseBitMask. Allows the bitmask to be reused 

when it has expired; if enabled the Rasterizer will 
not wait for a new mask when the current one has 
been used. 

= Disable 

1 = Enable 



7-98 



ResetPickResuit mnemonic 



ResetPickResult 

Name: ResetPickResult 
Units: Host Out 

Tag: 0x0184 
Reset Value: Undefined 
Write 

31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 

Reserved 

i i i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — ■ — i — i — i — i — i — i — i 

This command causes the current value of the picking result to be reset to 
zero. The data field is not used. 



Graphics Register Reference 7-99 



mnemonic RStart 



RStart 

Name: Initial Red Color 

Unit: Color DDA 

Tag: 0x00 FO 

Reset Value: Undefined 
Read/write 



31 




24 




16 




8 







■ i i i i i i 
Not used 




i i i i 
Integer 


i ■ i 

I 1 I L 


1 1 1 1 1 

Fraction 


i ■ 

i i 


i i i i i i 
Not used 



L Sign 



This register is used to set the initial value for the Red value for a vertex when 
in Gouraud shading mode. The value is 2s-complement 6.11 fixed-point 
format. 



7-100 



ScissorMaxXY mnemonic 



ScissorMaxXY 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Scissor Rectangle - Maximum XY 

Scissor/Stipple 

0x0032 

Undefined 



31 


24 16 




8 





i i i 
Not used 


i i i ■ i i i i i i i i 
12-bit 2s-complement Max Y 


i i i 
Not used 


i i i ■ i i i i i i i 
12-bit 2s-complement Max X 



Specifies the user scissor rectangle corner farthest from the screen origin. 



Graphics Register Reference 7-101 



mnemonic ScissorMinXY 



ScissorMinXY 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Scissor Rectangle - Minimum XY 

Scissor/Stipple 

0x0031 

Undefined 



31 


24 16 




8 





■ i i 
Not used 


i i i ■ i i i i i i i 
12-bit 2s-complement Min Y 


Not used 


12-bit 2s-complement Min X 



Specifies the user scissor rectangle corner closest to the screen origin. 



7-102 



ScissorMode mnemonic 



ScissorMode 



Name: ScissorMode 

Unit: Scissor/Stipple 

Tag: 0x0030 

Reset Value: Undefined 
Read/write 



31 24 16 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 1 



Reserved 



_ i — i — 1_ 



Screen scissor enable 

User scissor enable 



Controls enabling of the screen and user scissor tests. 

BitO User Scissor Enable: 

= Disable 

1 = Enable 

Bit1 Screen Scissor Enable: 

= Disable 

1 = Enable 



Graphics Register Reference 7-103 



mnemonic ScreenSize 



ScreenSize 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Screen Size 
Scissor/Stipple 
0x0033 
Undefined 



31 


24 16 




8 


■ i i i 
Not used 


i ■■ i i i i i i i 
11 bit unsigned integer Height 


i i i i 
Not used 


i i ■ i i i i i i i 
11 bit unsigned integer Width 



Screen dimensions for screen scissor clip. The screen boundaries are (0, 0) 
to (width - 1 , height - 1 ) inclusive. 



7-104 



sstart mnemonic 



SStart 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Initial texture S value 
Texture Address 
0x0071 
Undefined 



31 


24 


16 




8 







i i i i i i ■ i i i 


i i i ■ i 


i i i i i 


i ■ i i i i i 


i 




Integer 




Fraction 



















Sign 



Reserved 



Used to set the initial value for the S coordinate when texture mapping. Format 
is 2s-complement 12.18 fixed-point. 



Graphics Register Reference 7-105 



mnemonic StartXDom 



StartXDom 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Start X Value - Dominant Edge 

Rasterizer 

0x0000 

Undefined 



31 




24 


16 


8 





1 1 1 

Not used 




i ■ ■ i i i i i 
11 bit integer 




i i i i i i i ■ i i i i i i 
15 bit fraction 







Sign 



Not used 



Initial X value for the dominant edge in trapezoid filling, or initial X value in line 
drawing. The value is in 2s-complement 12.15 fixed-point format. 



7-106 



startxsub mnemonic 



StartXSub 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Start X Value - Subordinate Edge 

Rasterizer 

0x0002 

Undefined 



31 




24 


16 


8 





i i i 
Not used 




i i ■ i i i i i 
11 bit integer 




i i i i i i i ■ i i i i i i 
15 bit fraction 







Sign 



Not used 



Initial X value for the subordinate edge in trapezoid filling. The value is in 
2s-complement 12.15 fixed-point format. 



Graphics Register Reference 7-107 



mnemonic StartY 



StartY 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Start Y Value 
Rasterizer 
0x0004 
Undefined 



31 



24 



16 



Not used 



_l I L_ 



10 bit integer 



14 bit fraction 



_i — i — i — i — i — i — 1_ 



-I 1 L 



Sign 



Not used 



Initial scanline in trapezoid filling, or initial Y position for line drawing. The value 
is in 2s-complement 11.14 fixed-point format. 



7-108 



StatisticMode mnemonic 



StatisticMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Statistic Mode 
Host Out 
0x0181 
Undefined 



31 



24 



16 



-i — i — i — i — r 
Reserved 



-i — i — i — i — r 



_i i i i_ 



_i — i — i — i — 1_ 



_i — i — i — i — 1_ 



Include Spans 
Compare Function 
Monitor Culled Fragments 

Monitor Pixels Written 




Controls the mode of statistics collection. 



BitO 



Bit1 



Bit2 



EnableStats: 



StatsType: 



Active Steps: 



Bit3 



Passive Steps: 



= Disable Statistics collection 

1 = Enable Statistics collection 



= Picking mode 

1 = Extent collection 



= Excludes Pixels that were 
drawn 

1 = Includes Pixels that were 
drawn 

= Excludes fragments that were 
culled from being drawn 

1 = Includes fragments that were 
culled from being drawn 



Bit4 



CompareFunction: 

= Inside region 

1 = Outside region 



Graphics Register Reference 7-109 



mnemonic StatisticMode 



Bit5 Spans: 

= Exclude block filled spans 

1 = Include block filled spans 



7-110 



Stencil mnemonic 



Stencil 

Name: Stencil 
Unit: Stencil/Depth 
Tag: 0x0133 
Reset Value: Undefined 
Read/write 

31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 

Reserved 

i i i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — ■ — i — i — i — i — i — i — J — 

Stencil 

The stencil value to be used in clearing down the stencil buffer, or in drawing 
a primitive where the host supplies the stencil value. 



Graphics Register Reference 



7-111 



mnemonic StencilData 



Stencil Data 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Stencil Data 
Stencil/Depth 
0x0132 
Undefined 



31 


24 


16 




8 







i i i i i i i ■ i i i i i i 
Reserved 




Reserved 




Reserved 





Reference Stencil 



Write Mask Compare Mask 

Holds data used in the stencil test. 



The stencil writemask controls which stencil planes are updated as a result of 
the test. 



BitO 



Bit8 



Bit1 6 



Reference Stencil is the reference value for the 
stencil test. 

Compare Mask is the mask used to determine 
which bits are significant in the comparison. 

Stencil Writemask is the mask used to determine 
which bits in the localbuffer are updated. 



7-112 



StenciiMode mnemonic 



StencilMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Stencil Mode 
Stencil/Depth 
0x0131 
Undefined 



31 


24 


16 






8 









i i i 


i i i ■ i i i i 

Reserved 


-i — i — i — ■ — i 


i 

src 


i i 
tunc 


1 ■ ! 

sfail 


i i 
dpfail 


i i 
dppass 





Stencil source 



Unsigned compare function 

Update Method 



Unit enable 



Controls the stencil test, which conditionally rejects fragments based on the 
outcome of a comparison between the value in the stencil buffer and a 
reference value in the StencilData register. If the test is LESS and the result 
is true then the fragment value is less than the source value. 



BitO 



Bit1-3 



Bit4-6 



Bit7-9 



Mode Method 



Unit Enable: 



= Disable 

1 = Enable 



Update Method if Depth test passes and Stencil 
test passes: (see table below) 

Update Method if Depth test fails and Stencil test 
passes: (see table below) 

Update Method if Stencil test fails: 



Result 






Keep 


Source stencil 


1 


Zero 





2 


Replace 


Reference stencil 


3 


Increment 


Clamp (Source stencil + 1) to 2 stencil width - 1 


4 


Decrement 


Clamp (Source stencil -1 ) to 


5 


Invert 


-Source stencil 
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mnemonic stenciiMode 



Bit1 0—1 2 Unsigned Comparison Function: 

Mode = Comparison Function 

= NEVER 

1 = LESS 

2 = EQUAL 

3 = LESS OR EQUAL 

4 = GREATER 

5 = NOT EQUAL 

6 = GREATER OR EQUAL 

7 = ALWAYS 

Bit1 3-1 4 Stencil Source: 

= Test Logic 

1 = Stencil Register 

2 = LBData 

3 = LBSourceData 
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SuspendUntilFrameblank mnemonic 



SuspendUntilFrameblank 

Name: Suspend until frameblank 

Unit: Framebuffer R/W 

Tag: 0x01 8F 

Reset Value: Undefined 
Write 

31 24 16 



— i — i — i ■ — i 1 — r 

32 bit integer address 



This command causes all outstanding framebuffer writes to be flushed and 
then suspension of framebuffer accesses until the next frameblank period. 
The data field is the start address of the next frame to be displayed. This 
address will be used from the next frameblank until a new address is supplied. 

BitO-31 Address 
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7-115 



mnemonic Sync 



Sync 



Name: 

Unit: 

Tag: 

Reset Value: 
Write 



Synchronization 
Host Out 
0x0188 
Undefined 



31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 

31 user defined bits 

i i i i i i i — ■ — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — 

Interrupt enable 



This command can be used to synchronize the TVP401 with the host. It is 
also used to flush outstanding the TVP4010 operations such as pending 
memory accesses. It also causes the current status of the picking result to be 
passed to the Host Out FIFO, unless culled by the statistics bits in the Filter- 
Mode register. 

BitO-30 User Defined 

Bit31 InterruptEnable: 

= Disable Interrupt for this 
command 

1 = Enable Interrupt for this 
command 

The data output is the value written to the register by this command. If 
interrupts are enabled, then the interrupt does not occur until the tag and/or 
data have been written to the output FIFO. 

The corresponding tag value output is: 0x188 
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Texeio mnemonic 



TexelO 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Texel Value 
Texture/Fog/Blend 
OxOOCO 
Undefined 



31 




24 




16 




8 







1 1 1 1 1 1 1 

Alpha 


i i i i i i i 
Blue 


i i i i i i i 
Green 


i i i i i i i 
Red 


31 




24 




16 




8 







■ i i i i i i 
Alpha 


V 


U 


Y 



The texel value can be loaded using the Rasterizer SyncOnHostData mode. 
This is useful for direct application of procedural textures. It is also used when 
downloading YUV data which needs to be converted to RGB; the YUV 
conversion is done on the contents of this register. 

This register is also used to supply the background color if Force Background- 
Color has been enabled in either the RasterizerMode or the AreaStippleMode 
registers. 
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mnemonic TexeiLUT(0..i5) 



TexelLUT(0..15) 

Name: Texel LUT entries to 1 5 

Unit: Texture Read 

Tag: 0x01 D0,B,0x1 DF 

Reset Value: Undefined 
Read/write 



31 24 




16 




8 







■ i iii iii 
Not used 


i i i i 
Blue 


Not used 


i i i i 
Green 


Not used 


Red 


i i 
Not used 



The value to be loaded into the specified texel look-up-table entry. 
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TexeiLUTMode mnemonic 



TexelLUTMode 



Name: Texel LUT Mode 

Unit: Texture Read 

Tag: OxOOCF 

Reset Value: Undefined 
Read/write 



31 24 16 

— i 1 i — i — i — i i — ■ — i 1 — i 1 — i 1 — i ■ — i r 

Reserved 



-J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 u 



Enable 



Specifies the organization of the texture map in memory. 
BitO Enable: 

= No 

1 = Lookup 
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mnemonic TextureAddressMode 



TextureAddressMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Texture Address Mode 
Texture Address 
0x0070 
Undefined 



31 



24 



16 



I I I 



_l 1 1 1_ 



Reserved 



Delta Format 

Fast 

Perspective Correction 

Enable unit 



Controls the calculation of texture addresses. 

If bit 1 is set and bit2 is not, the TVP4010 performs accurate perspective 
correction. If both bits are set, the TVP4010 performs fast perspective 
correction. If both bits are cleared perspective correction is disabled. 

Note that undesirable results may occur if bit 2 is set but bit 1 is not. 

Note that the TextureEnable bit in the Render command must also be set for 
addresses to be generated. 



BitO 



Bit1 



Bit2 



Enable: 



= Disable 

1 = Enable 



Perspective Correction: 

= Disable 

1 = Enable 



Fast: 



= Disable 

1 = Enable fast perspective 
correction (if Bit1 is also set) 
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TextureAddressMode mnemonic 



Bit3 Delta Format - Allows the TVP401 to use texture 

delta values generated by GLINT Delta 

= Disable 

1 = Enable 
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mnemonic TextureBaseAddress 



TextureBaseAddress 



Name: Texture Base Address 

Unit: Texture Read 

Tag: OxOOBO 

Reset Value: Undefined 
Read/write 



31 24 16 8 

t — i — i — i — i — i — i ■ — i — i — i — i — i — i — i — r 



reserved 



24 bit unsigned integer 



Base address of texture map. Specified in texels from the base of memory. 
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TextureColorMode mnemonic 



TextureColorMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Texture Color Mode 
Texture/Fog/Blend 
OxOODO 
Undefined 



31 



24 



16 



i I i i i 



_l 1 1 1_ 



Reserved 



KsDDA 

KdDDA 



Texture Type 
Application Mode 

Texture Enable 



Controls the application of texture. The KsDDA and KdDDA bits enable the 
internal DDAs and should be set for modulate or highlight Ramp texture 
application modes. The Texture Type field differentiates between Ramp and 
RGB application modes. With Ramp Application Mode, various modes can be 
simulataneously applied e.g., decal with highlight. 

Note that the TextureEnable bit in the Render command must also be set for 
a primitive to be texture mapped. 



BitO 



Bit1-3 



Texture Enable: 

= Disable 

1 = Enable texture application 

Application Mode: 
RGB 

= Modulate 

1 = Decal 

2 = Reserved 

3 = Copy 



Ramp 

Bit 1 = Decal 
Bit 2 = Modulate 
Bit 3 = Highlight 



Bit4 



Bit5 



Texture Type: 



KdDDA: 




1 



1 



RGB 
Ramp 

Disable 
Enable 
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mnemonic TextureColorMode 



Bit6 KsDDA: 

= Disable 

1 = Enable 
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TextureData mnemonic 



TextureData 



Name: Texture Data 

Unit: Framebuffer R/W 

Tag: 0x01 1D 

Reset Value: Undefined 
Write 



31 24 16 

— i 1 i — i — i — i i — ■ — i 1 — i 1 — i 1 — i ■ — i r 



Data 

-J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L. 



Used with TextureDownloadOffset to load raw texture data into memory. This 
may include multiple texels depending on the texel size. 

BitO-31 Data 
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mnemonic TextureDataFormat 



TextureDataFormat 

Name: Texture Data Format 

Unit: Texture Read 

Tag: 0x00B2 

Reset Value: Undefined 
Read/write 



31 24 16 8 



I I 1 I 1 1 1 1 I I I 1 I I I ■ I I I I I I I ■ 

Reserved 








i i i 


Texture Format Extension J j 
Color Order 
No Alpha Buffe 


r 





Texture Format 

Specifies the color format of the texture map in memory. 

See the register outline above for a description of the bit fields. 
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TextureDataFormat mnemonic 



BitO-3 Texture Format: 



Internal Color Channel 



Format 3 


Color Order 


Name 


R/Y 


G/U 


B/V 


A 





BGR 


8:8:8:8 


8@0 


8@8 


8@16 


8@24 


1 


BGR 


5:5:5:1 Front 


5@0 


5@5 


5@10 


1@15 


2 


BGR 


4:4:4:4 


4@0 


4@4 


4@8 


4@12 


5 


BGR 


3:3:2 Front 


3@0 


3@3 


2@6 





6 


BGR 


3:3:2 Back 


3@8 


3@11 


2@14 





9 


BGR 


2:3:2:1 Front 


2@0 


3@2 


2@5 


1@7 


10 


BGR 


2:3:2:1 Back 


2@8 


3@10 


2@13 


1@15 


11 


BGR 


2:3:2 FrontOff 


2@0 


3@2 


2@5 





12 


BGR 


2:3:2 BackOff 


2@8 


3@10 


2@13 





13 


BGR 


5:5:5:1 Back 


5@16 


5@21 


5@26 


1@31 


14 


BGR 


CI8 


8@0 











15 


BGR 


CI4 


4@0 











16 


BGR 


5:6:5 Front 


5@0 


6@5 


5@11 





17 


BGR 


5:6:5 Back 


5@16 


6@21 


5@27 





18 


BGR 


YUV411 


8@0 


8@8 


8@16 


8@24 


19 


BGR 


YUV422 


8@0 


8@8 


8@8 








RGB 


8:8:8:8 


8@16 


8@8 


8@0 


8@24 


1 


RGB 


5:5:5:1 Front 


5@10 


5@5 


5@0 


1@15 


2 


RGB 


4:4:4:4 


4@8 


4@4 


4@0 


4@12 


5 


RGB 


3:3:2 Front 


3@5 


3@2 


2@0 





6 


RGB 


3:3:2 Back 


3@13 


3@10 


2@8 





9 


RGB 


2:3:2:1 Front 


2@5 


3@2 


2@0 


1@7 


10 


RGB 


2:3:2:1 Back 


2@13 


3@10 


2@8 


1@15 


11 


RGB 


2:3:2 FrontOff 


2@5 


3@2 


2@0 





12 


RGB 


2:3:2 BackOff 


2@13 


3@10 


2@8 





13 


RGB 


5:5:5:1 Back 


5@26 


5@21 


5@16 


1@31 


14 


RGB 


CI8 


8@0 











15 


RGB 


CI4 


4@0 











16 


RGB 


5:6:5 Front 


5@11 


6@5 


5@0 






3) The format column is also dependant on bit1 6. n@m means n bits starting at bit m. Front and Back modes 
replicate the color value to assist with double buffering. CI values are replicated into each byte to assist with 
double buffering. Offset modes have 64 added to the 7 bit formatted value. If the format has no alpha bits, the 
alpha field defaults to 0xF8. 
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mnemonic TextureDataFormat 



BitO-3 Texture Format: 



Internal Color Channel 


Format 4 


Color Order 


Name 


R/Y 


G/U 


B/V 


A 


17 


RGB 


5:6:5 Back 


5@27 


6@21 


5@16 





18 


RGB 


YUV411 


8@16 


8@8 


8@0 


8@24 


19 


RGB 


YUV422 


8@8 


8@0 


8@0 






4) The format column is also dependant on bit16. n@m means n bits starting at bit m. Front and Back modes 
replicate the color value to assist with double buffering. CI values are replicated into each byte to assist with 
double buffering. Offset modes have 64 added to the 7 bit formatted value. If the format has no alpha bits, the 
alpha field defaults to 0xF8. 



Bit4 No Alpha Buffer: 

= Alpha buffer present 

1 = Alpha buffer not present 

Bit5 Color Order: 

= BGR 

1 = RGB 

Bit6 Texture Format Extension. Most significant bit 

extension to Texture Format held in bitsO-3. 
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TextureDownloadOffset mnemonic 



TextureDownloadOffset 



Name: 

Unit: 

Tag: 

Reset Value: 
Write/Read 



Texture Download Offset 
Framebuffer R/W 
0x011 E 
Undefined 



31 



24 



16 



8 



-i 1 — i 1 1 — i 1 — ■ — i 1 r 

22-bit unsigned integer address 



Reserved 



This is the 32-bit aligned address at which the texture load will start. Each write 
to TextureData increments this value by one after the store has taken place. 
Note, if this register is read back it will not necessarily contain the same value 
as the written value. 



BitO-21 



Address 
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mnemonic TextureMapFormat 



TextureMapFormat 



Name: Texture Map Format 

Unit: Texture Read 

Tag: OxOOBI 

Reset Value: Undefined 
Read/write 



31 


24 


16 




8 







■ i i i i i i i i i 
Reserved 










i — i — i — i — i — i — i — i 
Reserved 


PP2 


PP1 


I 1 1 

PPO 




Partial product selection 

Texel Size' 

Reserved \ Window origin 
SubPatch mode 

Specifies the organization of the texture map in memory. 

Enabling subpatch addressing improves the performance of texture mapping 
in typical situations. 



BitO-2 



Bit3-5 



Bit5-7 



Bit1 6 



Bit1 7 



Partial Product - See Appendix C for a table of 
values 

Partial Product 1 - See Appendix C for a table of 
values 

Partial Product 2 - See Appendix C for a table of 
values 



Window Origin: 



= Top 

1 = Bottom Left 



Subpatch Mode: 



= Disable 

1 = Enable 
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TextureMapFormat mnemonic 



Bit1 9-20 



Texel Size: 



= 8 bits 

1 = 16-bits 

2 = 32 bits 

3 = 4 bits 
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mnemonic TextureReadMode 



TextureReadMode 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Texture Read Mode 
Texture Read 
OxOOCE 
Undefined 



31 


24 




16 




8 





■ i i i i i 
Reserved 




Reserved 




i — i — i — i — i 
Height 


i i i 
Width 


■ i i 

Reserved 









Packed Data 



Filter Mode 



TWrapMode 

SWrapMode 

Enable 



Controls texture read operations. When FilterMode is set, bilinear texture 
mapping is performed otherwise nearest neighbor texture mapping occurs. 
The S and TWrapModes specify the action to be taken when the S and T 
coordinates fall outside the required range. Clamp is useful when texture 
mapping a single image onto an object, Repeat cause the texture pattern to 
be repeated, whilst mirror causes the texture pattern to be alternately 
reversed. The Packed Data bit is used to define how texels are read from 
memory. If this bit is cleared, each texel is read one at a time; if set several 
texels can be read simultaneously improving efficiency. The actual number of 
texels read in this case is dependant on the texel size. 



BitO 



Bit1 -2 



Bit3-4 



Enable 



SWrapMode 



TWrapMode 



= Disable texture reads 

1 = Enable texture reads 



= Clamp 



1 

2 



Repeat 
Mirror 



= Clamp 

1 = Repeat 

2 = Mirror 
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TextureReadMode mnemonic 



Bit9-12 
Bit13— 16 
Bit17 



Width - log2 texture map width 
Height - log2 texture map height 
FilterMode 



Bit24 



PackedData 



= Disable bilinear texture 
filtering 

1 = Enable bilinear texture 
filtering 

= off 

1 = on 
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mnemonic TStart 



TStart 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Initial texture T value 
Texture Address 
0x0074 
Undefined 



31 


24 


16 




8 







■ i i i i i ■ i i i 




I I I I 


i ■ i i i i i 






Integer 




Fraction 



















Sign 



Reserved 



Used to set the initial value for the T coordinate when texture mapping. Format 
is 2s-complement 12.18 fixed-point. 
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WaitForCompietion mnemonic 



WaitForCompletion 



Name: Wait for completion 

Unit: Rasterizer 
Tag: 0x0017 
Reset Value: Undefined 
Write 

31 24 16 



- — i i — ■ — i 1 — i 1 — i 1 — i 1 — i r 

Reserved 



This command register causes the TVP4010 to suspend operation until all 
framebuffer writes have completed. Useful to separate, say, a texture 
download from subsequent primitives. 

BitO-31 Reserved 
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mnemonic Window 



Window 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Window 
Stencil/Depth 
0x0130 
Undefined 



31 



24 



16 



l i I i i i I 

Reserved 
_i i i i i i — i 



Reserved 



—I 1 1 1 1 i_ 



Disable LB Update 



LB UpdateSource 

Force LB Update 



Reserved 



If the Force LB Update bit is set, this overrides the stencil and depth tests, and 
the per unit enables, to force the localbuffer to be updated. Writes must still be 
enabled in the LBWriteMode register. When this bit is clear, any update is 
conditional on the outcome of the stencil and depth tests. 

If the Disable LB Update bit is set, the results of the stencil and depth tests are 
overridden and the localbuffer not updated, even if localbuffer writes are 
enabled. When writes are disabled in LBWriteMode there may be a 
performance advantage in also setting Disable LB Update. 



Bit3 



Bit4 



Bit1 8 



Force LB Update: 

= Not Forced 

1 = Forced 

LB Update Source: 

= LBSourceData 

1 = Registers 

Disable LB Update 

= enabled 

1 = disabled 
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WindowOrigin mnemonic 



WindowOrigin 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Window Origin 
Scissor/Stipple 
0x0039 
Undefined 



31 


24 


16 




8 





i i i 
Not used 


i i i ■ i i i i i i i i 
12-bit 2s-complement Y 


i i i 
Not used 


i i i ■ i i i i i i i 
12-bit 2s-complement X 



As each fragment is generated by the Rasterizer unit, this origin is added to 
the coordinates of the fragment to generate its screen coordinates. This 
occurs prior to doing the screen scissor test. 
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mnemonic XLimits 



XLimits 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



X extent for rasterizing 

Rasterizer 

0x0019 

Undefined 



31 


24 16 




8 





■ i i 
Not used 


i i i ■ i i i i i i i 
12-bit 2s-complement X Max 


Not used 


12-bit 2s-complement X Min 



Defines the X extent the Rasterizer should fill between. 
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YLimits mnemonic 



YLimits 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Y extent for rasterizing 

Rasterizer 

0x0015 

Undefined 



31 


24 16 




8 





i i i 
Not used 


i i i ■ i i i i i i i i 
12-bit 2s-complement Y Max 


i i i 
Not used 


i i i ■ i i i i i i i 
12-bit 2s-complement Y Min 



Defines the Y extent the Rasterizer should fill between. 
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mnemonic YUVMode 



YUVMode 

Name: YUV Mode 

Unit: YUV 
Tag: 0x01 EO 

Reset Value: Undefined 
Read/write 

31 24 16 8 

1 1 i 1 i i i 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 

Reserved 

i i i i i i i — ■ — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — ■ — i — I — I — I — i — i — I — 

TexelDisableUpdate 

RejectTexel 

Test Data 

TestMode 

Enable 



Control YUV to RGB conversion and/or chroma test. 



BitO 



Enable 



Bit1 -2 



TestMode 



Bit3 



TestData 



= YUV to RGB color space 
conversion disabled 

1 = YUV to RGB color space 
conversion enabled 



= No chroma test 

1 = Pass if within chroma bounds 

2 = Fail if within chroma bounds 



= Apply chroma test on input 

data (before color space 
conversion if enabled) 

1 = Apply chroma test on output 

data (after color space 
conversion if enabled) 
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YUVMode mnemonic 



Bit4 RejectTexel 

= Do not plot pixel if chroma test 

fails 

1 = Do not texture pixel if chroma 

test fails 

Bit5 Texel Disable Update 

= Pass on texel data 

1 = Reject texel data immediately 

after chroma test 
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mnemonic zstartL 



ZStartL 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth Start Value - Lower 

Stencil/Depth 

0x0137 

Undefined 



31 24 16 8 



1 1 1 1 1 1 1 I 1 1 

11 bit fraction 




i i i i i 
Not used 


i i i i i i i i 











This register holds part of the start value for depth interpolation. ZStartU holds 
the most significant bits, and ZStartL the least significant bits. The combined 
value is in 2s-complement 17.11 fixed-point format. 
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zstartu mnemonic 



ZStartU 



Name: 

Unit: 

Tag: 

Reset Value: 
Read/write 



Depth Start Value - Upper 

Stencil/Depth 

0x0136 

Undefined 



31 


24 


16 


8 





i i i 


i i i i i i i i i i i 




i i i i i i i i i i i i 


i i i 




Not Used 




16-bit integer 




i i i_ 


i i i — i — i — i — i i — i — i — i — 




— i — i — i — i — i — i — i — ■ — i — i — i — i — 





I 



Sign 

This register holds part of the start value for depth interpolation. ZStartU holds 
the most significant bits, and ZStartL the least significant bits. The combined 
value is in 2s-complement 17.11 fixed-point format. 
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Chapter 8 



Register Tables 



The following tables list registers by: unit, name and register address, giving 
their tag values and indicating their type. The register groups may be used to 
improve data transfer rates to the TVP4010 when using DMA. 

The following types of register are distinguished: 

Control: Set state and control bits ready to draw a 
primitive. This is the default and is indicated by a 
blank entry in the "Type" column. 

Command: Initiates some operation e.g. drawing of a 
primitive . 

Mixed: A control register which may also be used to 
supply successive data values during download. 

Output: An internal register that cannot be read or 
written, but whose contents is passed to the Host Out 
FIFO under the control of certain commands. 

In addition, the table indicates whether the register can be read back. A blank 
entry in this column indicates that the register contents cannot be read back. 
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Register Tables 



Table 8-1. Registers by Unit 



Unit 


Register 


Major Group 
(hex) 


Offset 
(hex) 


Type Readable 


Rasterizer 


StartXDom 


00 





• 




dXDom 


00 


1 


• 




StartXSub 


00 


2 


• 




dXSub 


00 


3 


• 




StartY 


00 


4 


• 




dY 


00 


5 


• 




Count 


00 


6 


• 




Render 


00 


7 


Command 




ContinueNewLine 


00 


8 


Command 




ContinueNewDom 


00 


9 


Command 




ContinueNewSub 


00 


A 


Command 




Continue 


00 


B 


Command 




BitMaskPattern 


00 


D 


Mixed 




RasterizerMode 


01 


4 


• 




YLimits 


01 


5 


• 




WaitForCompletion 


01 


7 


Command 




XLimits 


01 


9 


• 




PackedDataLimits 


02 


A 


• 


Scissor/Stipple 


ScissorMode 


03 





• 




ScissorMinXY 


03 


1 


• 




ScissorMaxXY 


03 


2 


• 




ScreenSize 


03 


3 


• 




AreaStippleMode 


03 


4 


• 




WindowOrigin 


03 


9 


• 




AreaStipplePattern(0-7) 


04 


0-7 




LB Read/Write 


LBReadMode 











LBReadFormat 




1 






LBSourceOffset 




2 






LBData 




3 






LBStencil 




5 


Output 




LBDepth 




6 


Output 




LBWindowBase 




7 


• 




LBWriteMode 




8 


• 
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Register Tables 



Table 8-1. Registers by Unit (Continued) 

Unit Register Major Group Offset Type Readable 



(hex) (hex) 

LBWriteFormat 11 9 

Stencil/Depth Window 13 

StencilMode 13 1 

StencilData 13 2 

Stencil 13 3 Mixed 

DepthMode 13 4 

Depth 13 5 Mixed 

ZStartU 13 6 

ZStartL 13 7 

dZdxU 13 8 

dZdxL 13 9 

dZdyDomU 13 A 

dZdyDomL 13 B 

Texture Address TextureAddressMode 07 

SStart 07 1 

dSdx 07 2 

dSdyDom 07 3 

TStart 07 4 

dTdx 07 5 

dTdyDom 07 6 

QStart 07 7 

dQdx 07 8 

dQdyDom 07 9 

Texture Read TextureBaseAddress 0B 

TextureMapFormat 0B 1 

TextureDataFormat 0B 2 

TexelO 0C 

TextureReadMode 0C E 

TexelLUTMode 0C F 

TexelLUT(0-1 5) 1 D 0-F 

YUV YUVMode 1E 

ChromaUpperBound 1E 1 

ChromaLowerBound 1E 2 
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Register Tables 



Table 8-1. Registers by Unit (Continued) 

Unit Register Major Group Offset Type Readable 



(hex) (hex) 

FBRead/Write FBReadMode 15 

FBSourceOffset 15 1 • 

FBPixelOffset 15 2 

FBColor 15 3 Output 

FBData 15 4 Mixed 

FBSourceData 15 5 Mixed 

FBWindowBase 15 6 • 

FBWriteMode 15 7 

FBHardwareWriteMask 15 8 • 

FBBIockColor 15 9 • 

FBReadPixel 15 A • 

TextureData 11 D 

TextureDownloadOffset 11 E • 

Color DDA RStart OF • 

dRdx OF 1 

dRdyDom OF 2 • 

GStart OF 3 

dGdx OF 4 

dGdyDom OF 5 • 

BStart OF 6 

dBdx OF 7 

dBdyDom OF 8 • 

AStart OF 9 

ColorDDAMode OF C • 

ConstantColor OF D • 

Color OF E Mixed 

Texture/Fog/Blend TextureColorMode OD • 

FogMode OD 2 • 

FogColor OD 3 • 

FStart OD 4 

dFdx OD 5 

dFdyDom OD 6 • 

KsStart OD 9 
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Register Tables 



Table 8-1. Registers by Unit (Continued) 



Unit 


Register 


Major Group 
(hex) 


Offset 
(hex) 


Type Readable 




dKsdx 


OD 


A 


• 




dKsdyDom 


OD 


B 


• 




KdStart 


OD 


C 


• 




dKddx 


OD 


D 


• 




dKddyDom 


OD 


E 


• 




AlphaBlendMode 


10 


2 


• 


Color Format 


DitherMode 


10 


3 


• 


Logical Ops 


FBSoftwareWriteMask 


10 


4 


• 




LogicalOpMode 


10 


5 


• 




FBWriteData 


10 


6 


• 


Host Out 


FilterMode 


18 





• 




StatisticMode 


18 


1 






MinRegion 


18 


2 






MaxRegion 


18 


3 






ResetPickResult 


18 


4 


Command 




MinHitRegion 


18 


5 


Command 




MaxHitRegion 


18 


6 


Command 




PickResult 


18 


7 


Command • 




Sync 


18 


8 


Command 




SuspendUntilFrameBlank 


18 


F 


Command 
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Table 8-2. Registers by Name 



Register 


Major Group 
(hex) 


Offset 
(hex) 


Type 


Readable 


AlphaBlendMode 


10 


2 




• 


AreaStippleMode 


03 


4 




• 


AreaStipplePattern(0-7) 


04 


0-7 




• 


AStart 


OF 


9 




• 


BitMaskPattern 


00 


D 


Mixed 




BStart 


OF 


6 




• 


ChromaLowerBound 


1E 


2 




• 


ChromaUpperBound 


1E 


1 




• 


Color 


OF 


E 


Mixed 




ColorDDAMode 


OF 


C 




• 


ConstantColor 


OF 


D 




• 


Continue 


00 


B 


Command 




ContinueNewDom 


00 


9 


Command 




ContinueNewLine 


00 


8 


Command 




ContinueNewSub 


00 


A 


Command 




Count 


00 


6 




• 


dBdx 


OF 


7 




• 


dBdyDom 


OF 


8 




• 


Depth 


13 


5 


Mixed 


• 


DepthMode 


13 


4 




• 


dFdx 


0D 


5 




• 


dFdyDom 


0D 


6 




• 


dGdx 


OF 


4 




• 


dGdyDom 


OF 


5 




• 


DitherMode 


10 


3 






dKddx 


OD 


D 






dKddyDom 


OD 


E 






dKsdx 


OD 


A 






dKsdyDom 


OD 


B 






dQdx 


07 


8 






dQdyDom 


07 


9 






dRdx 


OF 


1 






dRdyDom 


OF 


2 
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Table 8-2. Registers by Name (Continued) 



Register Major Group Offset Type Readable 

(hex) (hex) 



HQrlv 
QouX 


U/ 


O 
c. 




abuyuom 


U/ 


o 
O 




a i ox 


u/ 


c 
O 




dTdyDom 


U/ 


b 




aAUom 


UU 


A 
1 




dAbUD 


UU 


o 
O 




dr 


UU 


cr 

o 




az.axL 


1 o 


y 




dzaxu 


1 O 


o 
o 




dZdyDomL 


1 O 


D 




az.ayiJomu 


1 O 


A 

A 




rbDlOCKOOlOr 


1 


y 




rbooior 


1 


o 
O 


(~\ . , + — , ,4. 

(Jutput 


FBData 


1 


4 


Mixed 


FBHardwareWritGMask 


1 


o 
o 




rDr IXeivJTTSGl 


1 


o 
c. 




rDneaa ivioue 


1 


U 




1 |~) 1 \ - nrJDiwnl 

rDrieaurlXGI 


1 


A 

A 




rDboTtwarevvnteMasK 


1 u 


4 




rBbourceuata 


1 





Mixed 


rbbourceUTTset 


1 






rB window/Base 


1 5 


6 




rDwriTeuaia 


I u 


c 
u 




FBWriteMode 


15 


7 




FilterMode 


18 







FogColor 


OD 


3 




FogMode 


OD 


2 




FStart 


OD 


4 




GStart 


OF 


3 




KdStart 


OD 


C 




KsStart 


OD 


9 




LBData 


11 


3 




LBDepth 


11 


6 


Output 
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Table 8-2. Registers by Name (Continued) 



Register Major Group Offset Type Readable 

(hex) (hex) 



LDnGaarorrnai 




H 
\ 




LDnGaQIVIOQG 




A 
U 




LBbourceUTrset 


! ' 


d. 




1 DOfnnnil 

LBbtencn 


[\ 





/~\ 1 i4-in 1 1 + 

uutput 


LBWi ndowBase 


11 


-7 

( 




LBWriterormat 




n 

y 




LBwriteMoae 










LogicalOpMode 


4 a 


r- 





MaxHitRegion 


1 


c 




Command 


MaxRegion 


^ 
1 O 









MinHitRegion 


^ 
1 O 





Command 


MinRegion 


^ 
1 O 


d. 




Packed DataLimits 


AO 


A 

A 




rICKrieSUIt 


^ 
1 O 


-7 

/ 


Command • 


Uotart 


A"7 

U/ 


-7 

( 




RasterizerMode 


Ul 


A 

4 




Render 


AA 
UU 


-7 

( 


Command 


ResetPickResult 


H O 
1 O 


A 

4 


Command 


RStart 


Atr 
Ur 


U 




ScissorMaxXY 


AO 


d. 




ScissorMinXY 


AO 

Uo 


A 
\ 




ocissorivioue 


Uo 


A 
U 




ocreenoize 


Uo 


Q 




SStart 


07 


1 




StartXDom 


00 







StartXSub 


00 


2 




StartY 


00 


4 




StatisticMode 


18 


1 




Stencil 


13 


3 


Mixed • 


StencilData 


13 


2 




StencilMode 


13 


1 




SuspendUntilFrameBlank 


18 


F 


Command 


Sync 


18 


8 


Command 
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Table 8-2. Registers by Name (Continued) 


Register 


Major Group 
(hex) 


Offset 
(hex) 


Type 


Readable 


TexelO 


OC 







• 


TexelLUT(0-15) 


1D 


0-F 




• 


TexelLUTMode 


OC 


F 




• 


TextureAddressMode 


07 







• 


Textu re BaseAdd ress 


OB 







• 


TextureColorMode 


0D 







• 


TextureData 


11 


D 






TextureDataFormat 


OB 


2 




• 


Textu re Down loadOff set 


11 


E 




• 


TextureMapFormat 


OB 


1 




• 


TextureReadMode 


OC 


E 




• 


TStart 


07 


4 






WaitForCompletion 


01 


7 


Command 




Window 


13 









WindowOrigin 


03 


9 






XLimits 


01 


9 






YLimits 


01 


5 






YUVMode 


1E 









ZStartL 


13 


7 






ZStartU 


13 


6 






Table 8-3. Registers by Address 


Register 


Major Group 
(hex) 


Offset 
(hex) 


Type 


Readable 


StartXDom 


00 









dXDom 


00 


1 






StartXSub 


00 


2 






dXSub 


00 


3 






StartY 


00 


4 






dY 


00 


5 






Count 


00 


6 






Render 


00 


7 


Command 




ContinueNewLine 


00 


8 


Command 
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Table 8-3. Registers by Address (Continued) 



Register 


Major Group 
(hex) 


Offset 
(hex) 


Type 


Readable 


ContinueNewDom 


00 


9 


Command 




ContinueNewSub 


00 


A 


Command 




Continue 


00 


B 


Command 




BitMaskPattern 


00 


D 


Mixed 




RasterizerMode 


01 


4 






YLimits 


01 


5 






WaitForCompletion 


01 


7 


Command 




XLimits 


01 


9 






Packed DataLimits 


02 


A 




* 


ScissorMode 


03 







* 


ScissorMinXY 


03 


1 






ScissorMaxXY 


03 


2 






ScreenSize 


03 


3 




* 


AreaStippleMode 


03 


4 






WindowOrigin 


03 


9 






AreaStipplePattern(0-7) 


04 


0-7 




* 


TextureAddressMode 


07 









SStart 


07 


1 






dSdx 


07 


2 






dSdyDom 


07 


3 






TStart 


07 


4 




* 


dTdx 


07 


5 




* 


dTdyDom 


07 


6 




* 


QStart 


07 


7 






dQdx 


07 


8 






dQdyDom 


07 


9 






TextureBaseAddress 


0B 









TextureMapFormat 


0B 


1 






TextureDataFormat 


0B 


2 






TexelO 


OC 









TextureReadMode 


OC 


E 






TexelLUTMode 


OC 


F 
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Table 8-3. Registers by Address (Continued) 



Register Major Group Offset Type Readable 

(hex) (hex) 



TextureColorMode 


OD 





* 


FogMode 


OD 


2 


* 


FogColor 


OD 


3 


* 


FStart 


OD 


4 


* 


dFdx 


OD 


5 


• 


dFdyDom 


OD 


6 


* 


KsStart 


OD 


9 


* 


dKsdx 


OD 


A 


• 


dKsdyDom 


OD 


B 


* 


KdStart 


OD 


C 


* 


dKddx 


OD 


D 


* 


dKddyDom 


OD 


E 


• 


RStart 


OF 





* 


dRdx 


OF 


1 


• 


dRdyDom 


OF 


2 


* 


GStart 


OF 


3 


* 


dGdx 


OF 


4 


• 


dGdyDom 


OF 


5 


• 


BStart 


OF 


6 


* 


dBdx 


OF 


7 


• 


dBdyDom 


OF 


8 


• 


AStart 


OF 


9 


* 


ColorDDAMode 


OF 


C 




ConstantColor 


OF 


D 




Color 


OF 


E 


Mixed 


AlphaBlendMode 


10 


2 




DitherMode 


10 


3 




FBSoftwareWriteMask 


10 


4 




LogicalOpMode 


10 


5 




FBWriteData 


10 


6 




LBReadMode 


11 







LBReadFormat 


11 


1 




LBSourceOffset 


11 


2 
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Table 8-3. Registers by Address (Continued) 



Register Major Group Offset Type Readable 

(hex) (hex) 



LBData 


11 


3 




LBStencil 


11 


5 


Output 


LBDepth 


11 


6 


Output 


LBWindowBase 


11 


7 


• 


LBWriteMode 


11 


8 


• 


LBWriteFormat 


11 


9 


• 


TextureData 


11 


D 




TextureDownloadOffset 


11 


E 


• 


Window 


13 





• 


StencilMode 


13 


1 


• 


StencilData 


13 


2 


• 


Stencil 


13 


3 


Mixed • 


DepthMode 


13 


4 


• 


Depth 


13 


5 


Mixed • 


ZStartU 


13 


6 


• 


ZStartL 


13 


7 


• 


dZdxU 


13 


8 


• 


dZdxL 


13 


9 


• 


dZdyDomU 


13 


A 


• 


dZdyDomL 


13 


B 


• 


FBReadMode 


15 





• 


FBSourceOffset 


15 


1 


• 


FBPixelOffset 


15 


2 




FBColor 


15 


3 


Output 


FBData 


15 


4 


Mixed 


FBSourceData 


15 


5 


Mixed 


FBWindowBase 


15 


6 




FBWriteMode 


15 


7 




FBHardwareWriteMask 


15 


8 




FBBIockColor 


15 


9 




FBReadPixel 


15 


A 




FilterMode 


18 







StatisticMode 


18 


1 
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Table 8-3. Registers by Address (Continued) 


Register 


Major Group 
(hex) 


Offset 
(hex) 


Type 


Readable 


MinRegion 


18 


2 




• 


MaxRegion 


18 


3 




• 


ResetPickResult 


18 


4 


Command 




MinHitRegion 


18 


5 


Command 




MaxHitRegion 


18 


6 


Command 




PickResult 


18 


7 


Command 


• 


Sync 


18 


8 


Command 




SuspendUntilFrameBlank 


18 


F 


Command 




TexelLUT(0-15) 


1D 


0-F 




• 


YUVMode 


1E 







• 


ChromaUpperBound 


1E 


1 




• 


ChromaLowerBound 


1E 


2 




• 
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Appendix A 



Pseudocode Definitions 



In many areas of the document fragments of pseudocode are given, to 
describe the loading of registers. These are based on a C interface to 
TVP4010 in which each 32-bit register is represented as a C structure, 
potentially split into a series of bitfields. In an example where only a subset of 
the bitfields in a register are set, it is assumed either that a software copy of 
the register is being modified, or that the current contents of the register has 
first been read back to the host. This style has been chosen for clarity; there 
are often more efficient strategies. 

Note the constant definitions and register bitfield definitions are those used in 
the C example programs, for which header files are given in Appendix E. 



Warning: the order of loading control registers into the 
HyperPipeline has also been chosen for clarity, rather than 
effinciency. The optimal order is documented in section 3.2.3. 



Loading of a TVP4010 register is expressed as: 

register-name (value) 

When writing directly to the register file (i.e. to a FIFO) this would be 
implemented by writing "value" to the mapped-in address of the register called 
"register-name". 

Fragmentary examples are not in strict C syntax, a typical example is: 

// Sample code to rasterize a 10x10 rectangle at the 
// framebuffer origin. 

StartXDom (0) // Start dominant edge 

StartXSub (1<<16) // Start of subordinate 

dXDom (0x0) 
dXSub (0x0) 
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Pseudocode Definitions 



Count (OxA) 
YStart (0) 
dY (1«16) 

// Set up to render a trapezoid. 

render . AreaSt ippleEnable = TVP4 010_DISABLE 

render . PrimitiveType = TVP4010_TRAPEZOID 

render . FastFillEnable = TVP4 010_DISABLE 

render .FogEnable = TVP4 010_DISABLE 

render. TextureEnable = TVP4 010_DISABLE 

render . ReuseBitMask = TVP4 010_DI SABLE 

render . SyncOnBitMask = TVP4010_FALSE 

render . SyncOnHostData = TVP4010_FALSE 

Render (render) // Render the rectangle 

Code is shown in courier and comments are C++ style "//" indicating that the 
rest of the line is a comment. Any statement which ends in parenthesis is a 
register update, other statements will generally be assignments. A variable, 
say render, is of a type associated with the register being modified. This will 
usually be clear by the context and will not usually be declared as such. All the 
type definitions are in the header files. The values assigned to a register will 
be either a variable as described above, a macro i.e., TVP4010_TRUE, as 
found in the headers, or an immediate constant in C style format i.e., 0x45. In 
registers which have several fields, some of which are not relevant to a 
particular example, the field can be ignored completely or set to don't care. 
In some registers, values for fields which need to be set but are not readily 
available will typically be set as appropriate. 

In some fragments, simply a list of commands is given, e.g.: 

// Sample code to rasterize a rectangle 
StartXDom () // Start dominant edge 
StartXSub () // Start of subordinate 
dXDom ( ) 
dXSub () 
Count () 
YStart () 
dY () 

// Set-up to render an aliased trapezoid. 
Render () // Render the rectangle 
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Pseudocode Definitions 



This technique is used to simply give a feel for the registers involved in a 
particular operation and where a detailed treatment is not warranted. 

To take the address of a register, the name is used, thus this example stores 
the address of the StartXDom register in the buffer pointed to by the variable 
buf and increments the pointer: 

*buf++ = StartXDom 

To test the value of a register the register name is dereferenced using the C 
"*" operator as for instance in this example which tests for the completion of 
a DMA operation: 

while ( *DMACount != ) ; 
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Appendix B 



Screen Widths Table 



The screen width is specified as the sum of selected partial products so a full 
multiply operation is not needed. The partial products are selected by the fields 
PPO, PP1 and PP2 in the FBReadMode register, LBReadMode register and 
TextureMapFormat register. The range of widths supported by this technique 
are tabulated below, together with the values for each of the PP fields. 

Table B-1. Partial Products 



Screen Width 


PP2 


PP1 


PPO 














32 








1 


64 





1 


1 


96 


1 


1 


1 


128 


1 


1 


2 


160 


1 


2 


2 


192 


2 


2 


2 


224 


1 


2 


3 


256 


2 


2 


3 


288 


1 


3 


3 


320 


2 


3 


3 


384 


3 


3 


3 


416 


1 


3 


4 


448 


2 


3 


4 


512 


3 


3 


4 


544 


1 


4 


4 


576 


2 


4 


4 


640 


3 


4 


4 


768 


4 


4 


4 


800 


1 


4 


5 


832 


2 


4 


5 


896 


3 


4 


5 



B-1 



Screen Widths Table 



Table B-1. Partial Products(Continued) 



Screen Width 


PP2 


PP1 


PPO 


1024 


4 


4 


5 


1056 


1 


5 


5 


1088 


2 


5 


5 


1152 


3 


5 


5 


1280 


4 


5 


5 


1536 


5 


5 


5 



Note that the TVP401 supports a maximum screen resolution of 1 536 in width 
and 1024 in height. 
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Appendix C 



Glossary 




accumlation buffer: A color buffer of higher resolution than the displayed 
buffer (typically 1 6bits per component for an 8bit per component display). 
Typically used to sum the result of rendering several frames from slightly 
different viewpoints to achieve motion blur effects or eliminate aliasing 
effects.: 

activefragment: A fragment which passes all the various culling tests, such 
as scissor, depth(Z), alpha, etc., is written to/combined with the 
corresponding pixel in the framebuffer. See also "fragment" and "passive 
fragment". 

aliasing: A phenomena resulting from a rendering style which ignores the 
fact that a pixel may not be wholly covered by a primitive, leading to 
jagged edges on primitives. 

alpha blending: The ability to combine supplied Red, Green and Blue color 
values with those that exist in the framebuffer according to the supplied 
alpha value. Alpha blending forms the basis for techniques such as 
transparency and painting. 

alpha buffer: A memory buffer containing the fourth component of a pixel's 
color in addition to Red, Green and Blue. This component is not 
displayed, but may be used for instance to control color blending. 

area stipple: A two dimensional binary pattern which is used to cull 
fragments from being drawn. 



B 



bitblt: Bit aligned block transfer. Copy of a rectangular array of pixels in a 
bitmap from one location to another. 

bitblt double buffering: A technique to provide independent windowed 
double buffering by biting an area from one buffer to the other. 
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bitplane double buffering: A technique whereby fast independent 
windowed double buffering can be achieved by using a single bitplane 
bit. 



block write: A feature provided in some memory devices such as VRAM 
and SGRAM which allows multiple pixels to be set to a given value by a 
single write. Fast fill is an alternative name for this feature. 



chroma keying: Also known as bluescreening, this is the practice of 
excluding color from an image allowing an underlying image to show 
through. 

chroma test: The means by which chroma keying can be achieved. 

color index: The mode in which the color information is stored for each pixel 
as a single number, the color index rather than as separate Red, Green, 
Blue and optionally Alpha values (RGBA mode). Each color index 
references an entry in a color look up table that contains a particular set 
of Red, Green and Blue values. 

command register: A register which when loaded triggers activity in 
TVP401 0. For instance the Render command register when loaded will 
cause TVP4010 to start rendering the specified primitive with the 
parameters currently set up in the control registers. 

context: The state information associated with a particular task. Typically in 
a system more than one task will be using TVP401 to render primitives. 
Software on the host must save away the current contents of the 
TVP4010 control registers when suspending one task to allow another 
to run , and must restore the state when that task is next scheduled to run. 

control register: A register which contains state thatdictates how TVP401 
will execute a command. 

culling: The process of eliminating a fragment, object face, or primitive, so 
that it is not drawn. 



DDA: Digital Differential Analyser. An algorithm for determining the pixels to 
draw along a line or polygon edge. Also used to interpolate linearly 
varying values such as color and depth. 

delta: A gradient of color, fog, depth etc. in the X or Y directions for a 
primitive. 



Glossary 



depth (Z) buffer: A memory buffer containing the depth component of a 
pixel. Used to, for example, eliminate hidden surfaces. 

depth-cueing: A technique which determines the color of a pixel based on 
its depth. Used, for instance, to fade far away objects into the 
background. Also known as fogging. 

dithering: A rendering style which increases the perceived range of 
displayed colors at the cost of spatial resolution. The technique is similar 
to the use of stippled patterns of black and white pixels, to achieve 
shades of grey on a black and white display. 

dominant edge: The side of a primitive such as a triangle, which has the 
greatest range of Y values. 

double-buffering: A technique for achieving smooth animation, by 
rendering only to an undisplayed back buffer, and then swapping the 
back buffer to the front once drawing is complete. 



E 



extent checking: A technique which determines the rectangular bounds of 
the area which has been rendered to. 




fast fill: A feature provided in some memory devices such as VRAM and 
SGRAM which allows multiple pixels to be set to a given value by a single 
write. Block write is an alternative name for this feature. 

flat shading: The constant color shading or area filling of a primitive. 

fogging: A technique which determines the color of a pixel based on its 
depth. Used, for instance, to fade far away objects into the background. 
Also known as depth-cueing. 

fragment: A fragment is an object generated as a result of the rasterization 
of a primitive. It corresponds to and contains all the components of a 
single pixel. If a fragment passes all the various culling tests, such as 
scissor, depth(Z), stencil, etc., it will be written to/combined with the 
corresponding pixel in the framebuffer. 

framebuffer: An area of memory containing the displayable color buffers 
(front, back, left, right, overlay, underlay), their (optional) associated 
alpha components, and any associated (optional) window control 
information. This memory is typically separate from the localbuffer. 
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Glossary 



G 



Gouraud shading: The technique of variable color shading or area filling of 
a primitive using interpolation to gradually vary the color between 
vertices. Often known as smooth shading. 



H 



hardware writemask: A bitmask implemented in memory devices such as 
VRAM and SGRAM to enable or inhibit the writing of the corresponding 
bits of a fragment's color into the framebuffer. 

host: The processor which controls TVP401 0. 




localbuffer: An area of memory which maybe used to store textures and/or 
non-displayable depth(Z) and/or stencil pixel information. This memory 
is typically separate from the framebuffer. 

logic ops: The technique of applying logical operations such as OR, XOR 
or AND to the fragment color values and/or those in the framebuffer. 

LUT: A look-up-table. This normally contains color values to allow mapping 
from an index value to the desired Red, Green and Blue value. 




overlays: The technique of ensuring certain drawn objects always remain 
foremost in view and not obscured by others. Historically this was one 
method of providing a cursor and was usually achieved by providing 
extra bit planes. 




packed data: The arrangement of data in a buffer which allows multiple 
pixels to be read or written in a single access. 

passive fragment: A fragment which fails one or more of the various culling 
tests, such as scissor, depth(Z), stencil, etc., is nor written to/combined 
with the corresponding pixel in the framebuffer. See also "fragment" and 
"active fragment". 
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patched addressing: A technique whereby data is organized in memory 
such that there is improved performance for accesses to adjacent 
scanlines in a buffer. For TVP4010, this is available for depth and/or 
stencil buffer accesses. For textures a special form, subpatch 
addressing is provided. 

picking: A means of selecting drawn objects or primitives. 

preMult: A method of alpha blending, also known as Ramp blend mode, 
used by QuickDraw3D. 

pixel: Picture element. A pixel comprises the bits in all the buffers (whether 
stored in the localbuffer or framebuffer), corresponding to a particular 
location in the framebuffer. 

primative: A geometric object to be rendered. The TVP401 primitives are 
points, lines, trapezoids (including triangles as a subset), and bitmaps. 



R 



Ramp blend mode: A method of alpha blending, also known as preMult, 
used by QuickDraw3D. 

rasterization: The act of converting a point, line, polygon, or bitmap, in 
device coordinates, into fragments. 

rendering: Conversion of primitives in object coordinates into an image. 



s 



scissor test: A means of culling fragments which lie outside the defined 
scissor rectangle. The scissor rectangle is defined in device coordinates. 

software writemasking: A means of simulating hardware writemasking by 
performing a read-modify-write operation on framebuffer data. 

stencil buffer: A buffer used to store information about a pixel which 
controls how subsequent stencilled fragments at the same location may 
be combined with its current value. Typically used to mask complex 
two-dimensional shapes. 

stipple: A one or two dimensional binary pattern which is used to cull 
fragments from being drawn. 

subordinate edge: The sides of a primitive such as a triangle, which do not 
have the greatest range of Y values. 
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subpatch addressing: A technique whereby data is organized in memory 
such that there is improved performance for accesses to adjacent 
scanlines in a buffer. For TVP4010, this particular form of patched 
addressing is available for accessing texture maps. See also Patch 
Addressing. 

subpixel correction: A means of ensuring that all interpolated parameters 
associated with a fragment (color, depth, fog, texture) are correctly 
sampled at the fragment's center. This is required, for example, to ensure 
correct color shading of objects comprised of many primitives. 



tag: The data item that uniquely identifies a Graphics Core register. 

task: A process, or thread on the host which uses the TVP4010 
co-processor. Typically tasks assume that they have sole use of 
TVP401 and rely on a device driver to save and restore their TVP401 
context, when they are swapped out. 

texel: Texture element. An element of an image stored in texture memory 
which represents the color of the texture to be applied (fully or in part) to 
a corresponding fragment. 

texture: An image used to modify the color of fragments during processing. 
Often used for instance to achieve high realism in a scene, with relatively 
few primitives. 

texture mapping: The process of applying a two dimensional image to a 
primitive. For instance to apply a wood grain effect to a table. 



writemask: A bit pattern used to enable or inhibit the writing of the 
corresponding bits of a fragment's color into the framebuffer. See also 
Software Writemask and Hardware Writemask. 



YUV: An alternative color format to RGB, also known as YCbCr. Color format 
used by MPEG. 



Z buffer: An alternative name for the depth buffer. 
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back buffer, 2-21 , 3-5 
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Chroma Test, 6-13 
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lndex-1 



Index 



Clears, 6-6 

Color, 2-11,2-16, 3-8, 4-28, 4-63, 4-77, 4-88, 4-89, 

5- 6, G-13, G-67, G-98, 8-4, 8-6, 8-11 
Color buffers, 3-5 

Color DDA, 4-21 

Color DDA unit, 4-2, 4-58, 4-68, 4-69, 4-70, 4-73, 
4-78, 4-80, G-9, G-11, G-13, G-14, G-15, G-21, 
G-22, G-28, G-29, G-39, G-40, G-72, G-100, 8-4 
Color Format, 4-69, 5-7, G-4, G-5, G-31, G-32 
color format, 3-7, 3-8, 4-21 , 4-54, 4-55, 4-58, 4-63, 
4-79, 4-82, 5-2, 5-7, 6-9, 6-11, G-3, G-13, G-15, 
G-126 

Color Format Examples, 4-82 
Color Format Unit, 4-81 

Color Format unit, 2-24, 4-2, 4-63, 4-77, 4-81 , 5-7, 

6- 11, G-13, G-30, 8-5 
Color Formats, 3-9, 4-81 
color formatting, 4-62, 4-64 

Color Index, 1-2, 3-8, 4-54, 4-68, 4-81, G-13, G-15 
color index, 3-18 
Color Interpolation, 4-6 

color interpolation, 3-10, 3-11,4-5, 4-7, 4-69, 4-70 
color order, 3-9, 4-55, G-4, G-31, G-32, G-127, 
G-128 

ColorDDAMode, 4-70, 4-71, 4-80, 5-10, G-14, G-15, 

8-4, 8-6, 8-11 
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